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Abstract 

The long code is a central tool in hardness of approximation, especially in 
questions related to the unique games conjecture. We construct a new code that 
is exponentially more efficient, but can still be used in many of these applications. 
Using the new code we obtain exponential improvements over several known re- 
sults, including the following: 

1 . For any e > 0, we show the existence of an n vertex graph G where every 
set of o(n) vertices has expansion 1 - e, but G's adjacency matrix has more 
than exp(log'^ n) eigenvalues larger than 1 - e, where 6 depends only on s. 
This answers an open question of Arora, Barak and Steurer (FOCS 2010) 
who asked whether one can improve over the noise graph on the Boolean 
hypercube that has poly(logn) such eigenvalues. 

2. A gadget that reduces unique games instances with linear constraints modulo 
K into instances with alphabet k with a blowup of K'^°^^^°s('^\ improving over 
the previously known gadget with blowup of 2"'^'. 

3. An « variable integrality gap for Unique Games that that survives 
exp(poly(log log «)) rounds of the SDP + Sherali Adams hierarchy, improv- 
ing on the previously known bound of poly(log log «). 

We show a connection between the local testability of linear codes and small set 
expansion in certain related Cayley graphs, and use this connection to derandomize 
the noise graph on the Boolean hypercube. 
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1 Introduction 



Khot's Unique Games Conjecture [Kho02] (UGC) has been the focus of intense research 
effort in the last few years. The conjecture posits the hardness of approximation for a 
certain constraint satisfaction problem, and shows promise to settle many open questions 
in theory of approximation algorithms. Specifically, an instance T of the Unique Games 
problem with n variables and alphabet E is described by a collection of constraints of the 
form (x, y, n) where tt is a permutation over X. An assignment to F is a mapping / from 
[«] to E, and /'s value is the fraction of constraints (x, y, n) such that f{y) = n{f{x)). 
The Unique Games Conjecture is that for any £ > 0, there is some finite S such that it is 
NP hard to distinguish between the case that a Unique Games instance F with alphabet 
F has an assignment satisfying 1 - e fraction of the constraints, and the case that every 
assignment satisfies at most s fraction of F's constraint. 

Many works have been devoted to studying the plausibility of the UGC, as well as 
exploring its implications and obtaining unconditional results motivated by this effort. 
Tantalizingly, at the moment we have very little evidence for the truth of this conjec- 
ture. One obvious reason to believe the UGC is that no algorithm is known to contradict 
it, though that of course may have more to do with our proof techniques for algorithm 
analysis than actual computational difficulty. Thus perhaps the strongest evidence for 
the conjecture comes from results showing particular instances on which certain nat- 
ural algorithms will fail to solve the problem. However, even those integrality gaps 
are quantitatively rather weak. For example, while Arora, Barak and Steurer [ABSIO] 
showed a subexponential upper bound on an algorithm for the Unique Games and the 
related Small-Set Expansion problem, the hardest known instances for their algorithm 
only required quasipolynomial time [KoUO]. Similarly (and related to this), known 
integrality gaps for Unique Games and related problems do not rule out their solution 
by an C?(log ?i)-round semidefinite hierarchy, an algorithm that can be implemented in 
quasipolynomial (or perhaps even polynomial [BRS 1 1]) time. 

The long code has been a central tool in many of these works. This is the set of 
"dictator" functions mapping to F2 that have the form xi . . .x^ ^ xt for some /. 
Many hardness reductions (especially from Unique Games) and constructions of inte- 
grality gap instances use the long code as a tool. However, this is also the source of their 
inefficiency, as the long code is indeed quite long. Specifically, it has only N codewords 
but dimension 2^, which leads to exponential blowup in many of these applications.In 
this work, we introduce a different code, which we call the "short code", that is ex- 
ponentially more efficient, and can be used in the long code's place in many of these 
applications, leading to significant quantitative improvements. In particular, we use our 
code to show instances on which the [ABS 10] algorithm, as well as certain semidefi- 
nite hierarchies, require almost sub-exponential time, thus considerably strengthening 
the known evidence in support of the Unique Games Conjecture. Moreover, our results 
open up possibilities for qualitative improvements as well, in particular suggesting a new 
approach to prove the Unique Games Conjecture via an efficient alphabet reduction. 
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1.1 Our results 



At the heart of the long code's appUcations He its connection with the noisy Hypercube. 
This is the weighted graph Hn^b whose vertices are elements in where a random 
neighbor of ;c e is obtained by flipping each bit of x independently with probability 
s. ^ It is not too hard to show that the codewords of the long code correspond to the top 
eigenvectors of the noisy hypercube which also give the minimal bisections of the graph, 
cutting only an e fraction of edges. In addition, several converse results are known, 
showing that bisections (and more general functions) cutting few edges are close to these 
top eigenvectors (or dictatorships) in some sense. (One such result is the "Majority is 
Stablest" Theorem of [MOO05].) The inefficiency of the longcode is manifested in the 
fact that the number of vertices of the noisy cube is exponential in the number N of its 
top eigenvectors. 

The short code. Another way to describe the long code is that it encodes x e F^hy 
a binary vector Vx of length 2^ where Vx{f) = fix) for every function / : F2 — > F2. 
This view also accounts for the name "long code", since one can see that this is the 
longest possible encoding of x without having repeated coordinates. For every subset D 
of functions mapping F2 to F2, we define the D-short code to be the code that encodes 
;c by a vector Vx of length \D\ where Vxif) = f{x) for every f e D. Note that this is 
a very general definition that encapsulates any code without repeated coordinates. For 
€ N, we define the d-short code to be the the 2)-short code where D is the set of all 
polynomials over of degree at most d. Note that the 1 -short code is the Hadamard 
code, while the ?i-short code is the long code. We use the name "short code" to denote 
the d short code for d = 0(1). Note that the short code has 2" codewords and dimension 
roughly 2" , and hence only quasipolynomial blowup, as opposed to the exponential 
blowup of the long code. Our main contribution is a construction of a "derandomized" 
noisy cube, which is a small subgraph of the noisy cube that enjoys the same relations 
to the short code (including a "Majority is Stablest" theorem) as the original noisy cube 
has to the long code. As a result, in many applications one can use the short code 
and the derandomized cube in place of the long code and the noisy cube, obtaining an 
exponential advantage. Using this approach we obtain the following results: 

Small set expanders with many large eigenvalues. Our first application, and the 
motivation to this work, is a question of Arora, Barak and Steurer [ABSIO]: How many 
eigenvectors with eigenvalue at least 1 - e can an n-vertex small set expander graph 
have? We say a graph is a small set expander (SSE) if all sufficiently small subsets of 
vertices have, say, at least 0.9 fracton of their neighbors outside the set. [ABS 10] showed 
an upper bound of n^^^^ on the number of large (i.e., greater than 1 - e) eigenvalues of a 
small set expander. Arora et al. then observed that the subspace enumeration algoriffim 
of [KT07, KollO] for approximating small set expansion in an input graph takes time at 
most exponential in this number, which they then use to give an algorithm with similar 

' This graph is closely related and has similar properties to the unweighted graph where we connect x 
and y if their Hamming distance is at most eA'. 
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running time for the Unique Games problem. Up to this work, the best lower bound was 
polylog(n), with the example being the noisy cube, and hence as far as we knew the 
algorithm of [ABSIO] could solve the small set expansion problem in quasipolynomial 
time, which in turn might have had significant implications for the Unique Games prob- 
lem as well. Our derandomized noisy cube yields an example with an almost polynomial 
number of large eigenvalues: 

Theorem 1. For every e > 0, there is an n-vertex small set expander graph with 
2(iog«)"<" eigenvectors with corresponding eigenvalues at least 1 - e. 

Theorem 1 actually follows from a more general result connecting locally testable 
codes to small set expanders, which we instantiate with the Reed Muller code. See 
Section 2 for details. 

Efficient integraUty gaps. There is a standard semidefinite program (SDP) relaxation 
for the Unique Games problem, known as the "basic SDP" [KV05, RS09]. Several 
works have shown upper and lower bounds on the approximation guarantees of this 
relaxation, and for constant alphabet size, the relation between the alphabet size and 
approximation guarantee is completely understood [CMM06]. However, for unbounded 
alphabet, there was still a big gap in our understanding of the relation between the ap- 
proximation guarantee and the number of variables. Gupta and Talwar [GT06] showed 
that if the relaxation's value is 1 - e, there is an assignment satisfying 1 - 0(£log?i) 
fraction of constraints. On the other hand, Khot and Vishnoi [KV05] gave an integrality 
gap instance where the relaxation's value was 1 - l/poly(loglog?i)^ but the objective 
value (maximum fraction of constraints satisfied by any assignment) was o(l). It was 
a natural question whether this could be improved (e.g., see [Leell]), and indeed our 
short code allows us to obtain an almost exponential improvement: 

Theorem 2. There is an n-variable instance of Unique Games with objective value o(l) 
but for which the standard semidefinite programming (SDP) relaxation has value at least 
1 - l/qpolylog(?i).^ 

IntegraUty gaps for SDP hierarchies. Our best evidence for the hardness of the 
Unique Games Conjecture comes from integrality gap instances for semidefinite pro- 
gramming hierarchies. These are strengthened versions of the basic SDP where one 
obtains tighter relaxations by augmenting them with additional constraints, we refer 
to [CTIO] for a good overview of SDP hierarchies. These hierarchies are generally 
paramaterized by a number r (often called the number of rounds), where the first round 
corresponds to the Basic SDP, and the n''' round (where n is the instance size) corre- 
sponds to the exponential brute force algorithm that always computes an optimal answer. 
Generally, the r''' -round of each such hierarchy can be evaluated in n'^^''^ time (though in 

^Throughout, for any function /, poly(/(;!)) denotes a function g satisfying g(n) = f(n)^'-'\ 
^For functions /, g : N — > [0, oo) we write / = qpoly((^) if / = exp(polylog(gf)). That is, if there are 
constants C > f > such that for all sufficiently large n, exp((log g{n)y) < f(n) < exp((log g(n))'- ). (Note 
that we allow c < 1, and so / = qpoly(g') does not imply that / > g.) Similarly, we define qpolylog((;) = 
qpoly(logg) and write / = qqpoly(g) if / = exp(exp(poly(loglogg))). 
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some cases n'^'-^^l'^'-''^ time suffices [BRSll]). In this paper we consider two versions of 
these hierarchies — the SA hierarchy and the weaker LH hierarchy. Loosely speaking, 
the r'^ round of the SA hierarchy adds the constraints of the r'^ round of the Sherali- 
Adams hnear programming hierarchy (see [SA90]) to the Basic SDP; the r"' round of 
the LH hierarchy augments the Basic SDP with the constraints that and subset of r vec- 
tors from the vector solutions embeds isometricaUy into the £i metric. (See Section 8 
and [RS09] for more details.) 

Barak, Raghavendra and Steurer [BRSll] (see also [GSll]) showed that for every 
e > 0, n'^ rounds of the SA hierarchy yields a non-trivial improvement over the basic 
SDP . The unique games conjecture predicts that this is optimal, in the sense that n"'-^'' 
rounds of any hierarchy should not improve the worst-case approximation ratio above 
the basic SDP.'^ However, this prediction is far from being verified, with the best lower 
bounds given by [RS09] (see also [KS09]) who showed instances that require log^^^' n 
rounds for the LH hierarchy, and (log log n)^^^^ rounds for the SA hierarchy. Moreover, 
these instances are known to be solvable in quasipolynomial time [KollO] and in fact 
via polylog(«) rounds of the SA hierarchy [BRSl 1] . Thus prior work gave no evidence 
that the unique games problem cannot be solved in quasipolynomial time. In this work 
we obtain almost-exponentially more efficient integrality gaps, resisting qpoly(log?i) 
rounds of the SA hierarchy and qqpoly(?i) rounds of the LH hierarchy. The latter is 
the first superlogarithmic SDP hierarchy lower bound for Unique Games for any SDP 
hierarchy considered in the literature. 

Theorem 3. For every s > there is some k = k{£), such that for every n there is an n 
variable instance T of Unique Games with alphabet size k such that the objective value 
ofY is at most e, but the value on F of both qpoly(log n) rounds of the SA hierarchy and 
qqpoly(?2) rounds of the LH hierarchy is at least \ — e. 

A corollary of the above theorem is a construction of an «-point metric of negative 
type such that all sets of size up to some k = qqpoly(?i) embed isometricaUy into £i 
but the whole metric requires qpolylog(?i) distortion to embed into fi. We remark that 
Theorem 3 actually yields a stronger result than stated here — as a function of k, our 
results (as was the case with the previous ones) obtain close to optimal gap between 
the objective value and the SDP value of these hierarchies; in particular we show that 
in the above number of rounds one cannot improve on the approximation factor of the 
Geomans-Williamson algorithm for Max Cut. It is a fascinating open question whether 
these results can be extended to the stronger Lasserre hierarchy. Some very recent re- 
sults of Barak, Harrow, Kelner, Steurer and Zhou [BHK^U] (obtained subsequent to 
this work), indicate that new ideas may be needed to do this, since the Unique Games 
instances constructed here and in prior works are not integrality gaps for some absolute 
constant rounds of the Lasserre hierarchy. 

Alphabet reduction gadget. Khot, Kindler, Mossel and O'Donnel [KKMO04] used 
the long code to show an "alphabet reduction" gadget for unique games. They show how 

''This is under the widely believed assumption that NP ^ Dtime(exp(n"*''). 
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to reduce a unique game instance with some large alphabet K to an instance with arbi- 
trarily small alphabet. (In particular, they showed how one can reduce arbitrary unique 
games instances into binary alphabet instances, which turns out to be equivalent to the 
Max Cut problem.) However, quantitatively their result was rather inefficient, incurring 
an exponential in K blowup of the instance. By replacing the long code with our "short 
code", we obtain a more efficient gadget, incurring only a qusipolynomial blowup. One 
caveat is that, because the short code doesn't support arbitrary permutations, this re- 
duction only works for unique games instances whose constraints are affine functions 
over where k = log K; however this class of unique games seems sufficiently rich for 
many applications.^ 

Theorem 4. For every s there are k, 6, and a reduction that for every I maps any n- 
variable Unique Games instance F whose constraints are affine permutations over al- 
phabet F2 into an n ■ exp{poly{£, k))-variable Unique Games instance T' of alphabet k, 
such that if the objective value of T is larger than 1-6, then the objective value of T' 
is larger than 1 - e, and if the objective value ofT is smaller than 6, then the objective 
value ofV is smaller than s. 

Once again, our quantitative results are stronger than stated here, and as 
in [KKMO04], we obtain nearly optimal relation between the alphabet size k and the 
soundness and completeness thresholds. In particular for ^ = 2 our results match the pa- 
rameters of the Max Cut algorithm of Geomans and Williamson. Our alphabet reduction 
gadget suggests a new approach to proving the unique games conjecture by using it as 
an "inner PCP". For example, one could first show hardness of unique games with very 
large alphabet (polynomial or even subexponential in the number of variables) and then 
applying alphabet reduction. At the very least, coming up with plausible hard instances 
for unique games should be easier with a large alphabet. 

Remark 1.1. The long code is also used as a tool in applications that do not involve 
the unique games conjecture. On a high level, there are two properties that make the 
long code useful in hardness of approximation: (i) it has a 2 query test obtained from 
the noisy hypercube and (ii) it has many symmetries, and in particular one can read off 
any function of x from the x'^' codeword. Our short code preserves property (i) but (as is 
necessary for a more efficient code) does not preserve property (ii), as one can only read 
off low degree polynomials of x (also it is only symmetric under affine transformations). 
We note that if one does not care about property (i) and is happy with a 3 query test, then 
it's often possible to use the Hadamard code which is more efficient than the short code 
(indeed it's essentially equal to the <i-short code for d = I). Thus, at least in the context 
of hardness of approximation, it seems that the applications the short code will be most 
useful are those where property (i) is the crucial one. 

Despite the name "short code", our code is not the shortest possible code. While in 
our applications, dimension linear in the number of codewords is necessary (e.g., one 
can't have a graph with more eigenvalues than vertices), it's not clear that the dimension 

^For example, because the multiplicative group of the field Fan is cyclic, one can represent constraints 
of the form x,- - x, = c,j (mod 2" - 1) as linear constraints over (i.e., constraints of the form x, = CijXj 
where C,j is an invertible linear map over F^. 
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needs to be polynomial. It is a very interesting open question to find shorter codes that 
can still be used in the above applications. 



2 Our techniques 

To explain our techniques we focus on our first application — the construction of a small 
set expander with many eigenvalues close to 1. The best way to view this construction 
is as a derandomization of the noisy hypercube, and so it will be useful to recall why the 
noisy hypercube itself is a small set expander. 

Recall that the e-noisy hypercube is the graph Hn^e whose vertex set is {±1}'^ where 
we sample a neighbor of x by flipping each bit independently with probability s. The 
eigenvectors in //ats are given by the parity functions ;t'a(;c) = n/eo-^r for subsets a c 
[A'^] and the corresponding eigenvalues are /Iq, = ( 1 - 2sj"^ . Thus Aa only depends on 
the degree \a\ ofxa- In particular, the "dictator" functions = x, have eigenvalue 

I - 2e and they correspond to balanced cuts (where vertices are partitioned based on the 
value of Xi) with edge expansion e. As a increases, Aa decreases, becoming a constant 
around \a\ = 0(l/e). 

Given / : {+1}^ {0, 1) which is the indicator of a set 5, its Fourier expansion 
f{x) = Yja fio:)Xaix) can be viewed as expressing the vector / in the eigenvector basis. 
The edge expansion of S is determined by the distribution of its Fourier mass; sets where 
most of the Fourier mass is on large sets will expand well. Given this connection, small- 
set expansion follows from the fact that the indicator functions of small sets have most of 
their mass concentrated on large Fourier coefficients. More precisely a set S of measure 
lu has most of its Fourier mass on coefficients of degree Q(log(l///)). This follows from 
the so-called (2,4)-hypercontractive inequality for low-degree polynomials — that for 
every degree d polynomial /, 

E [/(x)4]<C E [fixff (2.1) 

.re{±l)'v .re{±l)'^ 

for some C depending only on d. (See Section 4.1 for the proof, though some intuition 
can be obtained by noting that if / is a characteristic function of a set S of measure 
ju = o(l) then E[/^]^ = /j.^ and E[/^] = ju and hence Equation (2. 1) shows that / cannot 
be an C?(l)-degree polynomial.) 

By a "derandomized hypercube" we mean a graph on much fewer vertices that still 
(approximately) preserves the above properties of the noisy hypercube. Specifically we 
want to find a very small subset 2) of {±1}^ and a subgraph G of Hni- whose vertex set 
is D such that (i) G will have similar eigenvalue profile to Hns, and in particular have 
A'^ eigenvalues close to 1 and (ii) G will be a a small set expander. To get the parameters 
we are looking for, we'll need to have the size of D be at most qpoly(A'^). 

A natural candidate is to take 2) to be a random set, but it is not hard to show that 
this will not work. A better candidate might be a linear subspace D c that looks 
suitably pseudorandom. We show that in fact it suffices to choose a subspace D whose 
dual C = T)^ is a sufficiently good locally testable code. (We identify with {+1}'^ 
via the usual map {bi,...,bN) ^ ((-1)*' , . . . , (-1)*").) 
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Our construction requires an asymptotic family of [N,K,D]2 linear codes C Q 
where the distance D tends to infinity. The code should have a eA/^-query local tester 
which when given a received word a e samples a codeword q of weight at most 
sN from a distribution T on C""" and accepts if {a, q) = I. The test clearly accepts 
codewords in C, we also require it to reject words that are distance at least D/10 from 
every codeword in C with probability 0.49. Given such a locally testable code C, we 
consider the Cayley graph** G whose vertices are the codewords of the dual code = C""" 
while the (appropriately weighted) edges correspond to the distribution T. That is, a 
vertex of G is a codeword x e D, while a random neighbor of x is obtained by picking a 
random q from 7~ and moving to x + ^. 

Because 2) is a subspace, it is easy to show that the eigenvectors of G are linear func- 
tions of of the form;^fQ,(x) for x,a e F^ (where if a®a' e C thenxo ^^'^Xa' are identical 
on G's vertices). Moreover, from the way we designed the graph, for every a e the 
corresponding eigenvalue Aa is equal to E^g7-[(- 1)^"'"'-'^] -1-2 P7-[Test rejects a]. This 
connection between the spectrum of G and the local testability of C allows us to invoke 
machinery from coding theory in our analysis. 

From this one can deduce that the eigenvalue spectrum of G does indeed resemble 
the hypercube in the range close to 1. In particular each;^j,j(x) = x,- is a distinct eigen- 
vector with eigenvalue 1 - 2s, and gives a bad cut in G (where vertices are partitioned 
based on the value of x,). On the other hand for any eigenvector of G, choose a of min- 
imal weight such that;^' = Xa- Now if \a\ > D/10 this means that the distance of a from 
C is at least D/10, which using the testing property implies that /Iq, < 1 - 2 • 0.49 = 0.02. 

If we can show that indicator functions of small sets have most of their Fourier mass 
on such eigenvectors (with small eigenvalue), that will imply that small sets have good 
expansion. For small subsets of the hypercube, recall that this is proved using (2,4)- 
hpercontractivity for low-degree polynomials. The key observation is that the inequality 

E [fix)'] < C E [fixff (2.2) 
xeV xeD 

Still holds for all polynomials / of degree d < D/4. This is because the distance of C is 
D, hence the distribution of a random x in 2) is D-wise independent, which means that 
the expectation of any polynomial of degree at most D is equal over such x and over a 
uniform x in {±1}^. Thus (2.2) follows from (2.1), completing our proof. 

We instantiate this approach with using for C the Reed Muller code consisting of 
polynomials in « variables over F2 of degree n-J-l. This is a code of distance D - 2"^"^ 
We note that the degree n-d-l and hence the rate of the code C are very high. The graph 
is over the codewords of D = C''~ that is itself the Reed Muller code of polynomials 
over Fj of degree d. Our basic tester consists of selecting a random minimum weight 
codeword of Thus our graph Q has as its vertices the d degree polynomials over ¥'2 
with an edge between every polynomials p, q such that p - q is a. product of d linearly- 
independent affine functions (as those are the minimal weight codewords in the Reed 

^Cayley graph are usually defined to be unweighted graph. However, the definition can be generalized 
straightforwardly to weighted graphs. 

'For many applications we amplify the success of this tester by selecting a sum of / random such words, 
this corresponds to taking some power of the basic graph described. 
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Muller code). We use the optimal analysis of Bhattacharyya, Kopparty, Schoenebeck, 
Sudan and Zuckerman [BKS"''10] to argue about the local testability of C which is a 
high degree Reed Muller code. We should note that this test is very closely related to the 
Gowers uniformity test that was first analyzed in the work of Kaufman et al. [AKK^05], 
but our application requires the stronger result from [BKS^IO]. 

2.1 Other applications 

We now briefly outline how we use the above tools to obtain more efficient versions of 
several other constructions such as alphabet reduction gadgets and integrality gaps for 
unique games and other problems. 

Efficient integrality gaps for Unique Games. To beign with, the graph we construct 
can be used to prove Theorem 2. That is, a construction of an M variable instance F 
of unique games where every assignment can satisfy at most a very small (say 1/100) 
fraction of the constraints, but for which the standard semidefinite programming (SDP) 
relaxation has value of at least 1 - 1/ qpoly(log M). The basic idea is to simply take 
the graph we constructed above, and turn it into an instance of unique games by 
considering it to be the label extended graph of some unique games instance. We now 
elaborate a bit below, leaving the full details to Section 7. Recall that a Unique Games 
instance F with M variables and alphabet E is described by a collection of constraints 
of the form {x, y, n) where tt is a permutation over X. An assignment to F is a mapping 
/ from [M] to E, and /'s value is the fraction of consti^aints (x, y, n) such that f{y) = 
n{f{x)). The label extended graph corresponding to F is the graph Gr over vertices 
[M] X E where for every constraint of the form (x, y, n) and cr € E we add an edge 
between (x, cr) and (y, JT(cr)). It is not hard to see that an assignment of value 1 - e 
corresponds to a subset S containing exactly M of Gr's vertices with small expansion 
(i.e., e fraction of the edges from S leave the set). Thus if Gr is an expander for sets of 
measure 1/|E| in Gp then there is no nearly satisfying assignment for the unique games 
instance F. In our case, our graph Q has the degree d polynomials over F'^ as its vertices, 
and we transform it into a unique game instance whose variables correspond to degree 
d polynomials without linear terms. The alphabet E consists of all linear functions 
over V^. We ensure that the graph Q is the label extended graph of F by setting the 
permutations accordingly: given a polynomial p without a hnear term, and a function q 
that is a product of d affine functions,^ if we write q = q' + q" where q" is the linear part 
of q, then we add a constraint of the form {p, p + q' , n) where n is the permutation that 
maps a linear function r into r + q" . Some not too difficult calculations show that the 
top eigenvectors of our graph Q yield a solution for the semidefinite program for F (if 
the top eigenvectors are /\ ... , our vector solution will associate with each vertex x 
the vector {f^{x), . . . ,/^(x)). By choosing carefully the parameters of the graph Q, the 
instance F will have SDP value 1 - 1 / qpoly(log M) where M is the number of variables. 

Actually, to get better parameters, we take some power t of meaning that we consider q that is a sum 
of t functions that are products of d affine functions. 
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Derandomized Invariance Principle. While hypercontractivity of low degree poly- 
nomials suffices for some applications of the long code, other applications require other 
theorems, and in particular the invariance principle, shown for the hypercube by Mos- 
sel, O'Donnel and Oleszkiewicz [MOO05]. Roughly speaking their invariance principle 
says that for "nice" functions / on the vertices of the A/^-dimensional noisy hypercube, 
the distribution of f(x) where ;c is a random vertex is close to the distribution of f{y) 
where y consists of independent standard Gaussian random variables (appropriately 
extending / to act on R^). To obtain more efficient version of these applications, we 
first show that the same holds even when x is a random vertex in our smaller subset of 
A/^-dimensional strings - the Reed-Muller codewords. Our central tool is a recent result 
by Meka and Zuckerman [MZIO] which derandomizes the invariance principle of Mos- 
sel et al. Our key insight is that taking a random Reed-Muller codeword can in fact be 
viewed as an instantiation of the Meka-Zuckerman generator, which involves splitting 
the input into blocks via a pairwise independent hash function, and using independent 
fc-wise independent distributions in each block. This allows us to obtain a version of 
the "Majority is Stablest" theorem for our graph, which is the main corollary of the in- 
variance principle that is used in applications of the longcode. See Section 5 for more 
details. 

Efficient alphabet reduction . With the "Majority of Stablest" theorem in hand, prov- 
ing Theorem 4 (efficient alphabet reduction for unique games), is fairly straightforward. 
The idea is to simply replace the noisy hypercube gadget used by [KKMO04] with our 
derandomized hypercube. This is essentially immediate in the case of alphabet reduc- 
tion to binary alphabet (i.e., reduction to Max Cut) but requires a bit more work when 
reducing to a larger alphabet. See Section 6 for more details. 

Efficient hierarchy integrality gaps. Our proof Theorem 3 again works by plugging 
in our short code / derandomized noisy hypercube in place of the long code in the pre- 
vious integrality gap constructions [KV05, KS09, RS09]. Specifically, these construc- 
tions worked by starting with an integrality gap for unique games where the basic SDP 
yields 1 - 1 /r, and then composing it with an alphabet reduction gadget to obtain a new 
instance; Raghavendra and Steurer [RS09] showed that the composed instances resist 
poly(r) rounds of the SA hierarchy and exp(poly(r)) rounds of the LH hierarchy. These 
constructions used the noisy cube twice — both to obtain the basic unique games gap in- 
stance, and to obtain the alphabet reduction gadget. We simply plug in our short code in 
both usages — using for the basic unique games instance the efficient version obtained in 
Theorem 2, and for the alphabet reduction gadget the efficient version obtained in The- 
orem 4. (Luckily, our unique games instance has affine constraints and so is compatible 
with our alphabet reduction gadget.) The result essentially follows in a blackbox way 
from the analysis of [RS09]. See Section 8 for details. 
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3 Preliminaries 



Let G be a regular graph with vertex set V. For a subset 5 c y we define the volume of 
5, denoted yu(S), to be l^l/IVI. We define the expansion of 5, denoted 0(S), to be the 
probabiUty over a random edge (m, u), conditioned on m e 5 that v ^ S . Equivalently 
(since G is regular), 0(5) = G{S, V\S)l{&e.gQ\S\) where deg^ is the degree of the graph 
G and G{S, V \ 5) is the number of edges going from S to V \ 5 . Throughout, we denote 
the normalized adjacency matrix of a graph G also by G, and refer to the spectrum of 
the adjacency matrix as the spectrum of the graph G. Note that by definition, every 
regular graph has maximum eigenvalue 1. In this paper, we use expectation norms for 
real-valued functions. That is, for a function /: 5 ^ R and p > 1, we let ||/||p := 

Many of the unique games instances that appear in this work belong to a special 
subclass of unique games, namely F2-Max-2Lin instances defined below. 

Definition 3.1. Given a group 9i, an *?/-Max-2Lin instance consists of a system of linear 
equations over the group 'H where each equation is of the form x; - xj - Cij for some 



Locally Testable Codes. Let C be an [A^, K, D]2 code, that is, C is a A'-dimensional 
Unear subspace of with minimum distance D (= min{wt(x) : x e C)). (In this 
paper, we are mostly interested in the extremely high rate regime when H = N - K 
is very small compared to A'^ and are happy with D being some large constant.) Let 
A(x, y) e [0,...,N} denote Hamming distance between x,y e F^. For a e and a 
code C we define 

def 

A{a,C) - minA(a, c). 

C€C 

Definition 3.2. We say a distribution T over F^ is a canonical tester for C if every 
vector in the support of the distribution 7" is a codeword q e C^. The query complexity 
of T is the maximum weight of a vector in its support. The tester's soundness curve 
St- N ^ [0, 1] is defined as 

=^ min P {<Qr,^) = 1} . 
A(a,C)>k 

Similarly, we denote the rejection probability of T for a vector a e F^ by sria) = 
IPq-r {{a, q) - 1}. We let the query probability r e [0, 1] of a tester be the expected 
fraction of queried coordinates, that is, r = E^^^t- wt((7)/A^. We say that a tester T with 
query probability r is smooth if for any coordinate / e [A'^], P^-r {qi - 1} - t and we say 
it is 2-smooth if in addition, for any two distinct coordinates / j, P^-r = qj = = 

If the tester T is clear from the context, we will sometimes drop the subscript of the 
soundness curve / rejection probability s<r- In the setting of this paper, we will consider 
testers with query probability slowly going to (with A^). Further, given a canonical 
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tester T", it is easy to amplify the probability of rejection by repeating the test and taking 
the XOR of the results. 

Finally, the following simple lemma gives some estimates for rejection probabilities 
of vectors for smooth testers. 

Lemma 3.3. If T is a smooth canonical tester with query probability t, then S'j-{a) < 
A{a, C) ■ T for every vector a € F^. Furthermore, if T is 2-smooth, then STia) > 
(1 - y) • A{a, C) ■ rfor every vector a e with A(a, C)t < y. 

Proof. Fix a € and let k = A(a,C). Without loss of generality, we may assume 
wt(a) = k. By renaming coordinates, we may assume ai = . . . = at = I and at+i = 
. . . - - 0. Then, s^ia) < P^~r {^i = 1) + ■ ■ ■ + ^q-r {<?yt = 1| - ^ • t. On the other 
hand, 

k 

sria) ^ y P {qi = 1) - y P [qi = qj =l]>kT- k\^ >il-y)-kT. □ 

We review the prerequisites for Majority is Stablest and Unique Games related re- 
sults in the corresponding sections. 



4 Small Set Expanders from Locally Testable Codes 

In this section we first use some known properties of hypercontractive norms to give a 
sufficient condition for graphs to be small set expanders. We then describe a generic 
way to construct graphs satisfying this condition from locally testable codes, proving 
Theorem 1. 



4.1 Subspace hypercontractivity and small set expansion 

Let be a subspace of the set of functions from V to R for some finite set V. We denote 
by P<v the projection operator to the space "V. For p,q> 1, we define 

\m,-.q max ^ . 

We now relate this notion to small set expansion. We first show that a subspace "V with 
bounded (4/3) 2 norm cannot contain the characteristic function of a small set: 

Lemma 4.1. Let f : V ^ {0, 1) such that = m^ev[fix)] then \\P^f\\l < W'VWl^^^^^l^^'^. 

Proof. This is by direct calculation 
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Note that if imi4/3^2 = 0(1) and/i = o(l), then = oiWfWj), meaning the 

projection of / onto V is small. It is often easier to work with the 2^4 norm instead 
of the 4/3 2 norm. The following lemma allows us to use a bound on the former to 
bound the latter: 

Lemma 4.2. 

imi4/3^2 < imi2^4 
Proof. Let / : y ^ R and let /' - Pyf. We know that 

E[/'2] = E[/' • /] (since /' is the projection of /) 

< ]E[/'4]i/4 ]E[//3]3/4 (by Holder's inequahty) 

= E[(P<v/')'^] ^^"^ F.if'^f'^ (projection is idempotent) 

<imi2^4E[(/'f]l/2E[/4/3]3/4. 

Dividing by Wfh = yields the resuh. n 

We now conclude that graphs for which the top eigenspace has bounded 2^4 norm 
are small set expanders. The lemma can be viewed qualitatively as a generalization of 
one direction of the classical Cheeger's inequality relating combinatorial expansion to 
eigenvalue gap [Che70]. 

Lemma 4.3. Let G = {V, E) be regular graph, and "V be the span of the eigenvectors of 
G with eigenvalue larger than A. Then, for every S QV, 

o(5)^ i-i-imiL4^^ 

Proof. Let / be the characteristic function of S , and write f - f + f" where /' = P-y/ 
(and so /" - f - f is the projection to the eigenvectors with value at most X). Let 
yu = yiiS). We know that 

cl)(5) - 1 - </,G/>/||/|i = 1 - {f,Gf)lpi (4.1) 

By Lemma 4.1, and Lemma 4.2, 

</,G/> - {f',Gf')+{f",Gf") < < \YV\%^^^n^l^+An < imiL4A''^'+^)"- 

Plugging this into (4.1) yields the result. □ 

4.2 Cayley graphs on codes 

Motivated by the previous section, we now construct a graph for which the projection 
operator on to the top eigenspace is hypercontractive, i.e., has small 2 — > 4 norm, while 
also having high rank. 

Let C c be an [A'^, K, D]2 code. The graph we construct will be a Cayley graph 
with vertices indexed by and edges drawn according to a canonical local tester T 
for C. Let Cay(C"'",T') denote the (weighted) Cayley graph with vertex set C""" and 
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edges generated by T. We describe the graph more precisely by specifying the neighbor 
distribution for a random walk on the graph. For a vertex p € C""", a random neighbor has 
the form p + q with q sampled from the tester 7". (Since the group has characteristic 
2, the graph Cay(C"'", 7") is symmetric for every tester T.) 

We will argue that if the tester T has small query complexity and good soundness, 
then the graph Cay(C"'", T") has many large eigenvalues while being a small-set expander. 

Theorem 4.4. Let C be an [N,K,D]2 linear code that has a canonical tester T with 
query complexity sN and soundness curve sQ cind let k < D/5. The graph Cay(C"'",T') 
has 2^"^ = 2^ vertices with at least N/2 eigenvalues larger than 1 - 4s. All subsets S 
ofC^ have expansion at least 

0(5) > 2s{k)-?>'' ^li^) 

By Xoring the results of mulitple tests, one can let the soundness s{k) tend to 1/2- 
Hence, if s{k) is significantly larger than e (for appropriate k), one can obtain a graph 
with many large eigenvalues such that small enough sets have near-perfect expansion. 

Eigenfunctions and Eigenvalues. We identify the graph G - Cay(C"'",7') by its nor- 
mahzed adjacency matrix. For every vector a e F^, the character : C""" — > {±1) with 
Xa{.p) = (-1)^"' ''^ is an eigenfunction of G. If two vectors a,/3 € belong to the same 
coset of C, they define the same character over since {a + p,p) = for all p e C^, 
while if a + P ^ C then (Xa^Xii) - 0- Thus, the set of characters of corresponds 
canonically to the quotient space F^ /C. If we fix a single representative a for every 
coset in F^ /C, we have exactly 2^~^ = 2^ distinct, mutually orthogonal characters. We 
define the degree of a character as follows: 

degixa) ^ min wt(a + c) ^ A{a, C) . (4.2) 

csC 

Note that if deg(;^a) < D/2, then the minimum weight representative in a 4- C is unique. 
(This uniqueness will allow us later to define low-degree influences of functions, see 
Section 5.) 

We let Aa denote the eigenvalue corresponding to character The following ob- 
servation connects the soundness of the canonical tester to the spectrum of G: 

Lemma 4.5. For any a e F^, Aa - I - 2s{a). 

Proof. From standard facts about Cayley graphs, it follows that 

^ - E [xaiq)] - E [(-1)"-^^] = l- 2P[«-g-l]-l - 2s{a). (4.3) 

qeT qeT qeT 

□ 

We use this to show that many dictator cuts in G which correspond to characters 
with degree 1 have eigenvalues close to 1. We let Ai,Xi denote A[ii,X{i]- As noted before, 
for D > 2 these are distinct characters. 

Corollary 4.6. We have Ai> \- As for at least NI2 coordinates [i] e N. 
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Proof. We have /l; = 1 - 2 Vq^rVqi = U- Since wt(^) < sN for every q eT, 

N 

y P [qi = \]< sN. 

So we can have Pqerlqi = I] 2e for at most N/2 coordinates. □ 

Another immediate conseuqence of Lemma 4.5 is that large degree characters have 
small eigenvalues. 

Corollary 4.7. Ifdegixa) > k then i„ < 1 - ls{k). 

Subspace Hypercontractivity. Given a function f:C^ — > R we can write it 
(uniquely) as a linear combination of the characters {Xa]ae¥':{ ic 

f(p)= 2] fia)Xaip), 

where f{a) = (xa, f) is the Fourier transform of / (over the abelian group C^). 

We define the degree of /, denoted deg(/) to be max^.^^^^^Q deg(;^^Q,). Note that 
deg(/ + g) ^ max{deg(/), d&g{g)} and deg(/g) < deg(/) + deg(^). The following 
crucial observation follows immediately from the fact that C has minimum distance D. 

Fact 4.8. The uniform distribution on is (D - 1) wise independent. That is, for any 
a € such that 1 < wt(Q') < Dwe have Epgc± Ufo-C/')] - 0- 

This fact has the following corollary: 

Lemma 4.9. Let i < (D - l)/4 and let 'V be the subspace of functions with degree at 
most i. Then Wi^a < 3^/^ 

Proof. The proof follows from the following two facts: 

1. This bound on the 2^4 norm is known to hold for true low degree polynomials 
under the uniform distribution on the hypercube by the Bonami-Beckner-Gross 
inequality [O'D08]. 

2. The expectation of polynomials of degree up to 4/' < D - 1 are the same under the 
uniform distribution and a D - 1-wise independent distribution. 

Given / : R" ^ R, let f'^ denote its projection onto the space spanned by 
characters where Ae.g(xa) ^ ^- We have 

Ii/ii4= E r/V]= E r/V], 

peC^ pe{0,l|" 

> \\f% = E [f\pf^ = E [/(p)2] . 

peC-L pe{0,l)" 
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By the 2^4 hypercontractivity for degree € polynomials over {0, 1)'', 

E [/V)']<9^ E Vfipff- 

So we conclude that 

peC^ peC^ peC^ 

which imphes that ||^||2^4 < 3^'^. □ 

Combining the above bound with Lemma 4.3 we get that, if the local tester rejects 
sufficiently far codewords with high probability, then the resulting graph is a small set 
expander: 

Corollary 4.10. For every vertex subset S in the graph C&y{C^, T) and every k < D/5, 
we have 

0(5) > 2sik)-3''n{S)^. 

In particular, as s{k) tends to 1/2, the expansion of small sets tends to 1. This 
corollary together with Corollary 4.6 completes the proof of Theorem 4.4. 



4.3 A Canonical Tester for Reed MuUer codes 

We instantiate the construction from the previous section for the Reed Muller code. Let 
C = RM(?i, « - J - 1) be the Reed Muller code on n variables of degree n - d- I, which 
has N = T^,H = Y^j^d ("•) ^^'^ ^ - 2''*^ Bhattacharyya, Kopparty, Schoenebeck, Sudan 
and Zuckerman [BKS^IO] analyze the canonical tester Trm which samples a random 
minimum weight codeword from C""". It is well known that the dual of RM(n, n - d - I) 
is exactly RM(«, <i) and that the minimum weight codewords mRM{n,d) are products of 
d linearly independent affine forms. They have weig ht 2"-'' = sN where 8 = 2-"^. Thus, 
our graph Cayp^yi = Cay(RM„_j,TRM) has as its vertices the d-degree polynomials over 
F2 with an edge between every pair of polynomials P, Q such that P - Q is equal to a 
minimum weight codeword, which are known to be products of d linearly independent 
affine forms. 

Theorem 4.11 ([BKS^IO]). There exists a constant 770 > such that for all n,d, and 
k < r]o2'^ the tester Trm described above has soundness s(k) > (k/2) ■ 2""^'. 

Theorem 4.1 1 allows us to estimate the eigenvalue profile of Cay^iy and shows that 
small sets have expansion close to 0(770)- From here, we can get near perfect expansion 
by taking short random walks. To avoid cumbersome discretization issues we work with 
continuous time random walks on graphs instead of the usual discrete random walks. 

Definition 4.12. For a graph G the continuous-time random walk on G with parameter 
t is described by the (stochastic) matrix G(t) = e~"-'~^\ G{t) and G have the same 
eigenvectors and the eigenvalues of G{t) are {e"'*^'"'^'^), where {//,•) is the spectrum of G. 
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We will view Cayp|y(f) as a weighted graph. We show that its eigenvalue profile is 
close to that of the noisy cube, this stronger statement will be useful later. 

Lemma 4.13. Let t = sl^^^ for e > and p - e~^. Let |/1q,) denote the eigenvalues of 
CayRM(0- 

- IfAegixa) - k, Aa max(p*^^^,p'^''^ ) where fiQ is an absolute constant. 

- For all 6 < d^for some constant 5(), if AtgiXa) = k < 6^2'^^^, \Aa — < 6. 

Proof. Let {fia} be the eigenvalues of Caypiy corresponding to the character Xa so that 
Aa = e''^^~'^"\ Let t - 2'^'^'*'^\ Since the canonical tester Trm for C is 2-smooth, by 
Lemma 3.3, fia ^ I - kr ± k^r^. Hence, Aa - g-'d"^") = ^"'^^(1*'^^) = p''e-'''''\ 

For ^ < 2"^, 1 - jUa = kr ± k^r^ > kr/l. Therefore, if degij^a) ^ 2''> = 
^-e2'*Ui-M.) ^ ^-s2-'*'kri2 ^ ^i/2_ p^j. ^ > 2^, by Corollary 4.7, < 1 - 2s{k) < Co 
for a universal constant Co < 1. Therefore, \Aa\ < e"'^^ (i-Co) = p(i-Co)2 p/Jo2 f^j. 
/lo < (1 - Co)/2. 

We now prove the second bound. If ek'^T < 5/10, we have Aa - p~*^(l + 5) which 
implies \Aa - pH < 6. Otherwise, if sk^T > 5/10, our assumption k < 6^T^^^ implies 
sk > 1/(10(5), hence e"^*^ < g-ro* < 6/4 for all 6 < 6q. For it < 2^, (1 - - 
A:t ± k^T^ > kTjl. Hence Ac, < e"'*^^''^ ^ ^-jos ^ for all 5 < 5(). In this case, we get 
l^«-p*l<l^l + lp*^l<5/2. 

□ 

Since the eigenvectors stay the same, Cayp|y|(0 inherits the hypercontractive proper- 
ties of Caypiy. In particular, by Lemma 4.9, ||'V||2-^4 < 3^^^ where denotes polynomi- 
als of degree I < Combining Lemmas 4.3 and 4.13, we obtain a graph with small 
set expansion and many large eigenvalues. 

i 

Theorem 4.14. For any e, ?/ > 0, there exists a graph G with 2(^°§l'^l)'' eigenvalues 
larger than 1 - sfor d - log(l/e) -I- log log(l /tj) + 0(1) and where every set S QG has 
expansion 

$(5)^ l-?7-3^'°g(i/'')^^ 

for some constant c\. 

Proof. Let po, 5q be constants from previous lemma. Fix ( - ^ log(-) so that e"'^^^^ - rj 
and d = log(^) -i- C2 so that £ < min(//o2'^'^^ 2'^/5). Consider the graph Cayp|y|(0 of the 
continuous random walk on Caypiyi where t = e2^^' as in Lemma 4.13. Note that the 
graph has \G\ = Yjj^d{'^) vetices. Let [pct) be the spectrum of Caypiy and A^ be the 
spectrum of CayR|y|(f). 

Then, for every a € F^/C, deg(Yff) = 1, we have ^^(a) ^ 2"'^. Hence ;Uq,> 1-2"'^^\ 
Aa > e~'^ ' = e~'^^. Therefore, there are at least N - 2^^°a\G\)^^'' eigenvalues which are 
larger than 1 - As. 

Since i < pq2'^^^ , by Lemma 4.13 if deg(xa) > £, Aa <= rj. Let "V be the subspace 
spanned by characters of degree at most £. Since £ < 2^/5 by Lemma 4.9, 11*1^112^4 ^ 
3^^^. Therefore, by Lemma 4.3, for any set 5 c G with p{S) < 6, 

(D(S)> l-77-3^'°s(^/'') y[JI(S). 
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□ 



Remark 4.15 (Coding application). The fact that our graph is a Cayley graph over F'^ 
has a potentially interesting implication for coding theory. By looking at the set of edge 
labels as the rows of a generating matrix for a code, we know that large Fourier coef- 
ficients corresponds to low weight codewords, and hence we get a code of dimension 
m = that has an almost exponential (i.e. 2") number of codewords of low weight, but 
yet has small generalized Hamming distance in the sense that every subspace of codi- 
mension a»(l) contains a codeword of fractional Hamming weight 1 - o(l). In particular 
by setting to be a function slowly tending to infinity we can get a linear code for which 
correcting from an o(l) fraction of corruption errors requires an almost exponential list 
size, but for which one can correct a fraction approaching 1 of erasure errors using a 
list of constant size. (The code obtained by taking all edges of our graph has an almost 
exponential blowup, but this can be reduced by subsampling the edges.) 



5 Majority is Stablest over Codes 

In this section we show an analogue of the "Majority is Stablest" result of Mossel et 
al. for the RM graph we constructed in the previous section; this will help us replace the 
noisy cube with the RM graph in various unique games gadgets. 

We first review some definitions. For a function / : {±1}^ R and £ > 0, define 

Inff (/) = 1/(^)1"- 

ae{0,l)~,|aK^,a, = l 

For p > 0, let Fp : [0, 1] [0, 1] be the Gaussian noise stability curve defined as 
follows. For € [0, 1], let f € R be such that Pt,^yv(o,i[0 < t] = fi. Then, Fp(//) = 
^x,¥[X <: t,Y ^ t], where {X, Y) € R^ is a two-dimensional mean zero Gaussian random 

vector with covariance matrix ( ^ ^ |. We refer the reader to Appendix B in Mossel 

et al. [MOO05] for a more detailed discussion on Fp. 

Let P(x) = 'ZjIc[N] Wiei^i be a A/^-variate multilinear polynomial P : R^ R. 
Define = YjI '^j and we say P is e-regular if for every / € [N], aj ^ ■ \\P\\^- 

Throught this section, we let Tp at (we omit N when the dimension is clear) denote 
the noisy hypercube graph with second largest eigenvalue p. 



5.1 Majority is stablest and Invariance 

The following theorem shows that, in the context of noise stability, a regular function 
on the hypercube behaves like a function on Gaussian space. 

Theorem 5.1 (Majority is stablest, [MOO05]). Letf: {±1]'^ [0, 1] be afunction with 
E/ = fi. Suppose Inff^^'^^^'^^V/) < t for all i e [A^]. Then, 

\J,ipJ) ^ ip^/^; + (i--p)iog(i/T) ' 

where Tp is the Boolean noise graph with second largest eigenvalue p and Fp is the 
Gaussian noise stability curve. 
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We will need the following ingredient of the proof of Theorem 5. 1 from [MOO05]. 
For a, ^ e R, let ([a,b] '■ ^ — ^ 1R+ be the functional ^[a,b]{x) - max|a - x,x -b,Of. For 
a real-valued random variable X, the expectation E^(X) is the Lj-distance of X to the 
set of [a, ft]-valued random variables (over the same probability space as X). We will be 
interested in the case a = and b = I. For this case, we abbreviate ^ = ^[o,i]- 

Theorem 5.2 (Invariance Principle, [MOO05, Theorem 3.19]). Let P be an T-regular 
N-variate real multilinear polynomial with degree at most t and < 1. Then, 



E ^oP(x)- E ^oP(i/) 

jce{±l)" y~A^(0,l)"| 



We will need the following corollary of that can handle functions that are not [0, 1]- 
valued as in the theorem but just close to [0, l]-valued functions. 

Corollary 5.3. Let f : {±\\^ ^"Kbe a function with E / = /i and E ^ o / < r. Suppose 
jjjf .y=?30iog(i/r) ^ j foy all i € [A^]. Then, 

\J,ipJ) ^ ipW + (l-p)log(l/r) ' 

where Tp is the Boolean noise graph with second largest eigenvalue p and Fp is the 
Gaussian noise stability curve. (Here, we assume that t is small enough.) 

Proof. Let /' be the closest [0, l]-valued function to /. Since \\f - f'\\ < ^Jt, it 
follows that inf^20iog(i/r)^, ^ j + o{^J^) « T^l^ and E/' < E/ + V^^- Since 
{f,Tpf) < {f',Tpf') + 0{^), the corollary follows by applying Theorem 5.1 to the 
function /'. (Here, we also use that fact that Fp(yu + ^JT) < Fp(yu) + 2 Vr. See Lemma 
B.3in [MOO05].) □ 

We remark that although we specialize to Reed-Muller codes in this section, most 
of the arguments generalize appropriately to arbitrary codes with good canonical testers 
modulo a conjecture about bounded independence distributions fooling low-degree poly- 
nomial threshold functions. We briefly discuss this in Section 5.3. 

To state our version of "Majority is Stablest" we first extend the notion of influences 
to functions over Reed-Muller codes. For <i € N, A/^ = 2", let C £ F^ be the Reed- 
Muller code RM(?i, « - J - 1) and let c be its dual RM(?i, d). For the rest of this 
section we assume that a set of representatives corresponding to the minimum weight 
codeword in each coset is chosen for the coset space F^/C. 

Definition 5.4. For a function / : C""" — > R and / € [N], { > 0, the ^-degree influence of 
coordinate / in / is defined by 

Inff(/)= Yj f^""^^- 

\a\^i,ai=l 
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(Recall that the Fourier coefficient f{a) = ^xec^lXai^)]-) As all a's with weight 
less than half the distance of C fall into different cosets of C, for { < D/2, the above 
expression simplifies to 

Inf f '(/) - J] haf . 

The sum of ^-degree influences of a function / can be bounded as below. 
Lemma 5.5. For a function / : C""" — > R and € < D/2 

2 Inff (/) < ^V[/] 
ie[N] 

where V[f] = E[/^] - (E[/])^ denotes the variance of f. 

Proof. The lemma is an easy consequence from the definition of Inf ^^(/) and the fact 
that V[/] - Yjai^o /(o')^- We include the proof for the sake of completeness. 

< { 2 f\af < 



We are now ready to state the main result of this section generalizing the Majority 
is Stablest result to Reed-Muller codes. Let Trm be the canonical tester for C as defined 
in Section 4.3. 

Theorem 5.6. There exist universal constants c, C such that the following holds. Let 
G be a continuos-time random walk on the RM graph Cay(C"'",TRM) with parame- 
ter t - sl'^^^. Let /: C""" — > [0, 1] be a function on with ^x~c^Vf{x)\ - fi and 
maxfe[w] Inff "'"^^^^^^Z) < r. Then, for d> Clog(l/T), 



cloglog(l/T) 
(l-p)log(l/T)' 



(/,G/)<r»+ ^. (5.1) 



where p = e ^ and Fp : R ^ R the noise stability curve of Gaussian space. 

The proof of the theorem proceeds in three steps. We first show that the eigenvalue 
profile of the graph G is close to the eigenvalue profile of the Boolean noise graph 
(see Lemma 4.13). We then show an invariance principle for low-degree polynomials 
(and as a corollary for smoothed functions) showing that they have similar behaviour 
under the uniform distribution over the hypercube and the uniform distribution over the 
appropriate Reed-Muller code. Finally, we use the invariance principle to translate the 
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majority is stablest result in the hypercube setting to the Reed-MuIIer code. The above 
approach is similar to that of Mossel et al., who translate a majority is stablest result in 
the Gaussian space to the hypercube using a similar invariance principle. 

We first state the invariance principle that we use below (see the next subsection for 
the proof). Recall the definition of the functional ^: R — > R from Section 5.1. 

Theorem 5.7. Let N = 2" and d > 41og(l/T). Let P : ^ R be a T-regular 
polynomial of degree at most t. Then, for x €„ |+1)^, z RM(?i, d), 

for a universal constant ci > 0. 

The (somewhat technical) proof of Theorem 5.6 from the above invariance princi- 
ple closely follows the argument of Mossel et al. and is defered to the appendix - see 
Section A. 

5.2 Invariance Principles over Reed-Muller Codes 

The various invariance principles of Mossel et al. [MOO05] are essentially equivalent 
(upto some polynomial loss in error estimates) to saying that for any low-degree regu- 
lar polynomial P, the polynomial threshold function (PTF) sign(P( )) cannot distinguish 
between the uniform distribution over the hybercube and the standard multivariate Gaus- 
sian distribution N{0, 1)^. 

Theorem 5.8. Let P : R^ R be a s-regular polynomial of degree at most t. Then, 
for any x ~ {±1}^ y ^ N{0, if, 

|E[sign(P(x))] -E[sign(P(i/))]| < 0{{s"^^^*'^). 

Ideally, we would like a similar invariance principle to hold even when x is chosen 
uniformly from the codes of the earlier sections instead of being uniform over the hy- 
percube. Such an invariance principle will allow us to analyze alphabet reductions and 
integrality gaps based on graphs considered in earlier sections (e.g., the RM graph). We 
obtain such generic invariance principles applicable to all codes modulo certain plausi- 
ble conjectures on low-degree polynomials being fooled by bounded independence. 

For the explicit example of Reed-Muller code we bypass the conjectures and directly 
show an invariance principle by proving that the uniform distribution over the Reed- 
Muller code fools low-degree PTFs. To do so, we will use the specific structure of the 
Reed-Muller code along with the pseudorandom generator (PRG) for PTFs of Meka 
and Zuckerman [MZIO]. Specifically, we show that the uniform distribution over RM 
can be seen as an instantiation of the PRG of [MZIO] and then use the latter 's analysis 
as a blackbox. Call a smooth function (A : R ^ R B-nice if \if/'-'^\t)\ < B for every f e R. 

Theorem 5.9. LetN ^ T andd > log ^-i-21og(l/£)4-2. LetP \ R!^ ^Rbe a s-regular 
multi-linear polynomial of degree at most £. Let x <— N{0, \)^ , z ~ RM(«, d). Then, for 
every l-nice function i/' : R ^ R 
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To prove the theorem, we first discuss the PRG construction of [MZIO]. Let t = l/e^ 
and M = N/t. Let : [A'^] — > [t] be a family of almost pairwise independent hash 
functions'^ and let D = be a (4f) -wise independent distribution over {+1)'". The 
PRG of [MZIO], G'H^j), can now be defined by the following algorithm: 

1. Choose a random h e 'H and partition [N] into t blocks B\,. . . ,Bt, with Bj - {i : 
h{i) = j}. 

2. Choose independent samples xi, . . . ,Xt <— D and let i/ € {+1}^ be chosen accord- 
ing to an arbitrary distribution independent of xi, . . . , x,. 

3. Output'" 

z € {+1}^, with z'i ^ Zi ■ Mi for i e [N] , where z\Bj ^ xj for j e [?]. (5.2) 

Meka and Zuckerman show that G<^ ;d as above fool (arbitrary) low-degree polynomials. 
Below we state their result for regular PTFs which suffices for our purposes and gives 
better quantitative bounds. 

Theorem 5.10. [Lemma 5.10 in [MZ09]] Let P : R'^ "Rbe a e-regular multilin- 
ear polynomial of degree at most €. Then, for x e„ {±1)'^, and y e {±1}^^ generated 
according to G<]-(^£), 

|E[(A(P(x))] - E[(A(P(i/))]| < ^f^e" . 

We next show that the uniform distribution over RM(«, <i) for a sufficiently high d 
is equivalent to G-^^) above, for an appropriately chosen hash family and (4/')- 
wise independent distribution D. Below we identify \N] with and [f] with for 
c = 21og(l/e). 

Proof of Theorem 5.9. For simplicity, in the following discussion we view RM(?i, <i) as 
generating a vector in F^ and show that the uniform distribution over PM{n,d) has 
the appropriate independence structure as required by Theorem 5.10, albeit with {±1} 
replaced with {0, 1). This does not the effect the analysis of the generator. 
Let c = 2 log(l/e) and let S be the subspace of polynomials of the form 

Ql(Xi,...,X„) ^ ^ ^{x\lc]=a)■Pa{Xc+l,...,Xn), 

where the polynomials Pa each have degree at most d - c. Note that we can sample a 
uniformly random element Q\ € .S by choosing independent, uniformly random degree 
at most d - c polynomials Pa : Fj"'^ — > F2 for a € |0, l)'^ and setting Q\ as above. This 
is because, each collection {Pa)ae{Q,\Y leads to a unique element of S and together they 
cover all elements of S. 

'A hash family 'H is almost pairwise independent if for every i j e [N], a,b e [t], P/,i=„'/f [/i(0 = 
a A h(j) = &]<(!+ a)/t- for a = 0(1). 

'"The description we give here is slightly different from that of [MZIO] due to the presence of the string 
y. However, the analysis of [MZIO] works without any changes for this case as well. 
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Let S' be a subspace of degree d, «-variate polynomials such that S n S' = {0} and 
S, S' together span all degree d polynomials. Let ^ : F2 — > be the space of all affine 
transformations. For A e Jl, let '■ \N] — > [f] be defined by /za(^) = and let 

"7/ = {Ha '■ A € It is easy to see that for A €„ Jl, the hash functions Ha are almost 
pairwise independent. Observe that for Q\ e„ 5, Q2 €„ S' and A e„ Jl, the polynomial 
2( ) - (Qi + QiXM )) is uniformly distributed over all «-variate degree d polynomials. 

Now, fix a polynomial Q2 € S'. Then, for a random Qi €„ S, we have 

= ^ 1 (/JA W = a) ■ Pa{Ua+l, ...,Un) + Qliu), 

aeiO.l]': 

where u = Ax and the polynomials iPa)ae{o,i]':, are independent uniformly random poly- 
nomials of degree at most d - c in n - c variables. Let D denote the distribution of 
(/"(M))„g]pn-c for P' a uniformly random polynomial of degree at most d - c in (n - c) 
variables. Then, for every fixed A e ^ and Q2 € S', the distribution of the evaluations 
of Q restricted to different buckets Ba = {x : hA{x) - a] are independent of one another. 
Moreover, within each bucket Ba, the evaluations vector {Q\{x))xeBa is distributed as D, 
which is (2^"'' - 1) -wise independent. 

Therefore, for every fixed Q2 € S' , the distribution of z = {Q{x))xewi is the same as 
the output of Gi-ix) as defined in Equation 5.2, where y - Q2{A{x)). The theorem now 
follows from Theorem 5. 10. □ 

The invariance principle of Theorem 5.9 combined with the appropriate choice of 
the smooth function ijj gives us the following corollaries. 

Proof of Theorem 5.7. Follows from using Theorem 5.9 and an argument as in Theo- 
rem 3.19 of [MOO05] who get a similar conclusion for the hypercube starting from an 
invariance principle for the hypercube to the Gaussian space. □ 

Corollary 5.11. Let N = T and d ^ log £ + 21og(l/e) + 2. Let P : ^ be a 

E-regular polynomial of degree at most i. Then, for x {±1}^, z RM(m, d), 

|E[sign(P(x))] -E[sign(P(z))]| < OC^e^^^^f+i)) 

Proof. Follows from Theorem 5.9 and Lemma 5.8 in [MZ 10]. □ 

Finally a similar argument in the proof Theorem 5.9, using a minor modification 
of the full analysis of the PRG from [MZIO] (Theorem 5.17), shows that Reed-Muller 
codes with d = n(^log(l/e)) fool all degree £ PTEs. We exclude the proof in this work 
as we do not need the more general statement in our applications 

Theorem 5.12. There exists a constant C > such that the following holds. Let N = 2" 
and d = Ci log( 1 / s). Let P : -^^be a multilinear polynomial of degree at most t. 
Then, for x^u {±1}^, z €„ RM(?i,(i), 

|E[sign(P(x))] - E[sign(P(z))]| < e. 
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5.3 Invariance Principles over Codes 



Our main tool for proving that "majority is stablest" result over Reed-Muller codes, 
Theorem 5.6, was the invariance principle Theorem 5.7. We conjecture that similar re- 
sults should hold for any linear code with sufficiently large dual distance so that the 
codewords have bounded independence. In particular, we conjecture that bounded inde- 
pendence fools arbitrary low-degree polynomial threshold functions (PTFs) over {±1}". 

The conjecture is known to be true for halfspaces [DGJ"''09], degree two PTFs 
[DKNIO] and for Gaussians with bounded independence [Kanl 1]. 

Conjecture 5.13. For all d e 'N and e > 0, there exists k = k{d,E) such that the 
following holds: Let Q be an n-variate multilinear real polynomial with degree d. Let X 
be an k-wise independent distribution over {±1}" and let Y be the uniform distribution 
over {±1}". Then, |Esigno2(X) - Esignoe(F)| < s. 

Finally, we remark that for the application to "majority is stablest" it suffices to show 
a weaker invariance principle applicable to the ^ functional. 

Conjecture 5.14. For alld efi and s > 0, there exists k - k{d, e) and rj - j]{e) such that 
the following holds: Let Q be an n-variate multilinear real polynomial with degree d. Let 
X be a k-wise independent distribution over { + 1}" and let Y be the uniform distribution 
over {±1}". Suppose that E Q{Xf < 1 a«<i E^ o Q(X) < rj. Then, E^ o Q{Y) < e. 

We show in the appendix that Conjecture 5.13 implies Conjecture 5.14. 

Lemma 5.15. Let X be a 20{-wise independent distribution over {±1}^ that E-fools 
every r-regular degree-£ PTF. Then, for every T-regular N-variate multilinear real poly- 
nomial Q with degree at most £ and E Q{X) < 1, we have for the uniform distribution Y 
over{±\f, 

E^ o Q{Y) < E^ o Q{X) + 2^<^^£°'^ . 



6 Efficient Alphabet Reduction 

The long code over a (non-binary) alphabet Q consists of the set of dictator functions 
{/i, ■ ■ ■ ,/v : ^ ei, where fix) - xt for all x € 2^. 

A natural 2-query test for this code was proposed by Khot et al. [KKMO07] and 
analyzed in Mossel et al. [MOO05]. The queries of the test are associated with the 
edges of the E-noise graph on . In this graph, the weight of an edge {x,y) is its 
probability in the following sampling procedure: Sample x € uniformly at random 
and resample each coordinate of x € independently with probability e to generate 

In this section, we present a more efficient code that serves as an analogue for the 
long code over a non-binary alphabet. For «, <i € N, let N = 2" and let C Q F^ be ffie 
Reed-Muller code ¥M{n,n - d - \) and let D ^ C"^ € F^ be its dual RM(?i,<i). Let 
T QD denote the canonical test set for the code C as in Section 4.3. 

Let f e N and let Q = V^. We define the following distribution 7", over D' (the ?-fold 
direct sum of £), a subspace of F2^), 
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- Sample c from the test set 7" c D. 

- Sample w - . . . , w^'^) from F2 at random. 

- Sample z = {z'-^K z'-'^) e D' by setting 

^ |o if = . 

Consider the continuous-time random walk on the graph Cay(D', 7",) with parameter 
£ • 2'^ (starting in point € D'). Let Te^t be the distribution over D' corresponding to 
this random walk. The Cayley graph Cay(D' ,7's t) will serve us as an analogue of the 
£-noise graph on Q^. 



Spectrum. In the following we will demonstrate that (part of) the spectrum of the 
Cayley graph Cay(£)^ T^^^e) corresponds to the spectrum of the e-noise graph on Q^. To 
this end, we recall the spectrum of the e-noise graph on Q^. First, we define a convenient 
basis for the functions on 2 = We will denote the coordinates of a vector a e Q = ¥'2 
by a = (a^'\ . ■ ■ , a^'^)- The set of characters of F'^ is Ixa : Fj ^ {±1} | a e F2), where 

Since the noise graph on is a Cayley graph over the abelian group F'^, the 
characters of this group form a basis of eigenfunctions. For fi = (Jii, . . . e Q^, let 
Xfi- — ^ l±l) denote the character 

Xp{x\,...,XN) ^ Y[xp,{xi). 

The eigenvalue of xp in the e-noise graph on 2^ (1 - e)™"^) where wt(;S) - |{/ 1 yS,- + 0')| 
is the Hamming weight of yS as a \&ngth-N string over alphabet Q. (In this section wt(-) 
will always refer to the Hamming weight of strings over alphabet Q.) 

The canonical eigenfunctions of Cay(2)',T'f) and Cay(D',T",_£) are indexed by j6 € 
IC . (Note that C' is the orthogonal complement of T)'.) Analogous to the definition 
in Section 4.2, we define the degree of a character : ^ {±1} for y6 e IC' as, 

degCf/j) = ^iifi) = min wt(/3') , 

where wt^S') = \{i e [A'^] \ t 0') I is the Hamming weight of yS' seen as a length- A/^ 
string over alphabet Q. (Here, the minimum is over all /3' e that lie in the same coset 
as jS in Q^/C.) 

The following lemma is an analogue of Lemmas 3.3 and 4.13 and shows that the 
eigenvalues of Cay(D', 7",) are similar to the eigenvalues of the e-noise graph. 

Lemma 6.1. Let (5 € jC'. The eigenvalue Ap of the character xp in the graph 
Cay(£)^ T,) satisfies Ap = l- wt(j6)/2^' ± 0{w\{fi)l2'^f and Afj^l- 0(1/0 • min{wt(j6) • 
2-^,1). 
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Proof. We will first prove an upper bound on Ajj for the case that wt(/3) » 2'^ . We write 
P = . . . with yS^') e F^. Let z = (z^^K z^'^) e&bea. string drawn from the 
distribution Tf. Note that z^'^ = w^'^ ■ c, where w = (w^^\ . . . , w^'^) and c are sampled as 
in the definition of 7",. Since w is a random vector in V^^, we can upper bound /lyj, 

Ap = E(-l)^''> 

= 1-2 P IZLi w®(/3®,c> - l) 

- 1 - P \3i.{p^'\c} = l] 
ceT ' ' 

< 1 - max P |(/3®,c> = l) . 

ie[t] ceT ' ' 

Without loss of generality, we may assume that ^^'^ has Hamming weight (as a binary 
string) at least \N{(Ji)/t. By Theorem 4.11, if \n{(JS) > 7/2"'' for sufficiently small rj > 0, 
we can upper bound A/j < I - 0.{r]/t). 

Next, we will estimate A/j (from below and above) for wt(jS) «c 2~^. Let / c [A^] be 
the set of coordinates / e [A'^] with /3, i= 0'. We claim, 

^« - 1 - P 11/ n supp(c)| = 1} ± 0(1) • P j|/ n supp(c)| > 2} . 

c c 

We write = (fii,... ,Pn) with fit e F^. Then, </3,z) = ^jiem ^iiw^Pi)- We refine the 
event z) = 1 according to the cardinality of / n supp(c). If / n supp(c) = 0, then 
(yS,z) = 0. On the other hand, conditioned on |/ n supp(c)|, the event (/?, z) = 1 is 
equivalent to the event {w,Pig} = 1 with |/ol = 10 supp(c). Since y6,„ 0^ this event has 
(conditional) probability i/2- Hence, 

P {<y6, z> = l) - ^ ^P_ {|/ n supp(c)| - l) ± P_ {|/ n supp(c)| > 2) , 

which implies the claimed estimate for A/j. 

It remains to estimate the distribution of |/ n supp(c)|. The argument is similar to the 
proof of Lemma 3.3. For every coordinate / e [N], we have Peer {ci = 1} = 2~^. Thus, 
P 1 1/ n supp(c)| ^1) < |/| • 2"^ = wt(/3)/2''. On the other hand, for any two distinct 
coordinates i i= j € [N], we have Peer {c; - = l} = 2"^'^'. Therefore, 

P { 1/ n supp(c)| - 1) ^ J] p {c,- = 1} - J] P {a = cj = \]> wtOS)/2^ - (wtOS)/2^)2. 

iel i< jel 

Similarly, P 1 1/ n supp(c)| > 2} ^ {w{(J3)/2''f. We conclude that 

A/j = l- w{(J3)/2'^ ± 0{\N{(J3)/2'^f 

(Note that the estimate is only meaningful when wt(yS) «: 2'^.) □ 

If the character ;t'y3 has eigenvalue A^ in the graph Cay(D^7",), then it has eigenvalue 
g-£(\-Ap)/2'' Cay(D', T^e). Similarly to Lemma 4.13, the eigenvalue of a character ;t/3 
is close to e"^*'^^ in the graph Cay(D^ Tt^e)- 
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Lemma 6.2. - If wt(yS) < 6^2'^ for sufficiently small 5, then the character xp has 
eigenvalue e''' ™'*^) ± 6 in the graph Cay (2)', Tt^e)- 

- For an absolute constant cq and all /3 e Q^/iy, < mdjL{p^^'>l'=^* ,p^'' 1'=^'). 

Influences. Let p e Q^/C. Suppose wt06) < wt(C0/2. (Note that C' c has the 
same minimum distance as C c F^. ) In this case, we will identify /? with the (unique) 
codeword of minimum weight in the equivalence class p e jC' . 

Definition 6.3. For a function /: — > R, a coordinate / € \N'\, and a degree bound 
£ < dist(C0/2), we define the (-degree influence of coordinate i on f as 

Inf f (/) = Yj f^^^ ■ 

(Here, refers to the i-th coordinate of the unique minimum-weight representative of 
the equivalence class /3.) 

6.1 Majority is Stablest 

In this section, we show an analogue of the majority is stablest theorem of [MOO05] 
on the e-noise graph on just as Theorem 5.6 showed an analogue of the majority is 
stablest theorem over the Boolean noise graph. 

Theorem 6.4. For every e,S,t > 0, there exists L, d, r such that ifG denotes the graph 
CayiD' constructed using Reed-Muller codes of degree d, then for every function 
f-.jy^ [0, 1] with max,g[Ar] Inf^^(/) < t, 

(/,G/><rp(//) + 5, (6.1) 

where p - e~^, p - ^x-d' lf(^)] '^^d Fp : R R the noise stability curve over 
Gaussian space. 

Given the characterization of the spectrum of Cay(D^ Tt^e) (Lemma 6.2), the proof 
of Theorem 6.4 is similar to that of Theorem 5.6. For the sake of completeness, we 
include a proof sketch in the appendix - see Section A. 1. re. 

6.2 2-Query Test 

We will now describe a dictatorship test for functions on &, analogous to the 2-query 
dictatorship test on e-noise graph. 

We are interested in functions f:D'^Q where Q = ¥'^. Note that v € can also 

be thought of as j; e 2^. For all /3 € Fj, the jS''' dictator function xp from £)' c to Q 
is given by, 

Xfiic) = cp 

Clearly, the dictator functions are linear functions over D', i.e..,xi3{c+c') = X/}i(^)+X/}ic')- 
This linearity is used to perform the 2-query test via. folding. Note that for each a e Q, 
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the constant function a(x) - a for all x e F", belongs to the code We v/ill fold the 
function by enforcing that for all a e Q, f(c + a) = f(c) + a for all a e Q. 
The details of the 2-query dictatorship test is described below. 



DICT 

Input: f:2y^Q 

Folding. The function is assumed to satisfy f{c + r) = /(c) + r for every c e £)' and 
r e Q. This is enforced hy folding the table of the function /. 

- Sample a vertex c e D^ 

- Sample a neighbour c' e D' of the vertex c in the Cayley graph Ca.y(D', 7~(,e). 

- Sample r e Q uniformly at random. 

- Accept if f{c + r) - r = f{c') 



Given a function / : ^ Q, we can arithmetize the value of the test in terms of Q 
functions {fa}a€Q that are defined as 

fa{x) = I[fix) = a] . 

Due to folding, we have fa(x) - fa+r(x+r) for all r e Q. For each a e Q, the expectation 
of fa is given by, 

E /„(c)= P [/(c + r) = a] = i 

where we used the fact that / is folded. The probability of acceptance of the 2-query 
test can be written in terms of the functions fa as follows: 

P[Test accepts/] = y E [fabric + r)fa(c')] = V E [fa(c)fa(c')] .. 

^{c,c')~Csiy(!D',r,X ^{c,c')~Cay(!D',T,X 

where (c,c') ~ Cay(£)^ 7~f,g) denotes a uniformly random edge in the graph 
Cay(£)^nJ. 

Theorem 6.5. The 2-query dictatorship test DICT described above satisfies the follow- 
ing completeness and soundness, 

- (Completeness) Every dictator function Xpi^) — accepted by the test with 
probability at least I - s. 



(Soundness) For every 6 > 0, there exists t,L such that if f satisfies 

; Q then f is acc 



max,g[;v] Inf f ^(/a) ^ ^for all a e Q then f is accepted with probability at most 



where p — e 
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Completeness. Recall that for a c e C""" generated from distribution T^, for each x e 
F2 (see Lemma 4.5) 

P [c(x) = 0] > 1 - 0(e) . 

It is easy to see that by construction, this property holds for the distribution 71_£ also, 
namely, 

P \c{x) - 0] ^ 1 - 0(e) . 

Hence for a random edge (c, c') in the Cayley graph Cay(2)', T^.e) and an /? e c{jS) = 
c'iP) with probability 1 -e. Therefore, for each p e V^, the jSth dictator function satisfies 
the test with probability 1 - 0(e). 



Soundness. The probability of acceptance of the 2-query test is given by, 
Pr[Test accepts/] = V E [/„(c)/„(c')] 

^(c,c')~Cay(£)',r,,) 

By applying Theorem 5.6, there is an appropriate choice of L, r such that if 
max,-6[7V] Inf 5^(/a) < r for all a then the probability of acceptance can be bounded by 

P[Test Accepts] = Y,iU, Gfa) <Q-Tp(^) + 6, 

aeQ 

where p - e''^ and G = Cay(D', Tt^s). The conclusion follows. 



7 Efficient integraUty gaps for unique games 

In this section, we present constructions of SDP integrality gap instances starting from 
a code C along with a local tester. To this end, we make an additional assumption on the 
code C. Specifically, let us suppose there exists a subcode of D = C""" with distance 
^. Formally, we show the following result. 

Theorem 7.1. Let C be an [N, K, D]2 linear code with a canonical tester T as described 
in Definition 3.2. Furthermore, let be a subcode of T) = with distance 5. Then, 
there exists an instance of unique games, more specifically a '7/-Max-2Lin instance, 
whose vertices are T) (\T)\ — 2^~^) and alphabet "H such that: 

— The optimum value of the natural SDP relaxation for unique games is at least 
(1 ~ ^) where t is the number of queries made by the canonical tester T. 

- No labelling satisfies more than 

( ] 

min 1 - 2s(k) + 

fraction of constraints. 
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Instantiating the above theorem with the Reed-Muller code and its canonical tester 
we obtain the following explicit SDP integrality gap instance. 

Corollary 7.2. For every integer n, 5 > there exists a F2-Max-2Lin instance T on 
M - 2^'°^ " vertices such that the optimum value of the SDP relaxation on T is \ - 
^^ iog(i/(?) .^ = 1-0 ^ 2a°gtogMti/2 ) ^'^'^^ every labelling ofY satisfies at most 0{6) fraction 
of edges. 

Proof. Fix the code C to be the Reed-Muller code RM(?i, n - log n) of degee d = log n 
over n variables. The block length of the code is N = 2", while the rate is A' = 2" - 
^ 2" - 0(2'°s""). This code contains the Hadammard code which is of 
relative distance ^. 

Let Trm denote the canonical Reed-Muller tester for RM(w, n - log n), and let 
denote the XOR of r-independent tests. Let us fix r = 1001og(l/5), thus yielding a 
canonical tester making t = log(l/(5) • 2""'' queries. By the work of [BKS^IO], this 
tester has a soundness of at least s{k) = ^ - (1 - k/2'^'^^y/2. With k - 2'^/10, the above 
soundness is at least s{k) > 1/2 - 6/2. Using Theorem 7.1, the optimum value of the 
resulting F2-Max-2Lin instance is at most S. On the other hand, the SDP value is at least 

(1 - 2t/Nf = 1 - 1001og(l/5)2«-'^/2" = 1 - ol'^] = 1 - i^i^ . 

\ 2" / n 

□ 

Starting from C, we construct an SDP integrality gap instance T{C, T) for unique 
games as described below. 



The vertices of are the codewords of D. The alphabet of the unique games instance 
r(C, T) are the codewords in TY. The constraints of unique games instance F(C, T) 
are given by the tests of the following verifier. 

The input to the verifier is a labeling { : D ^ "]-(. Let us denote by 7? = \1-(\. The 
verifier proceeds as follows: 

- Sample codewords c € D and h,h' e'H uniformly at random. 

- Sample a codeword q € D from the tester T. 

- Test if 

iic + q + h)- i{c + h') = h-h' 

SDP Solution. Here we construct SDP vectors that form a feasible solution to a natural 
SDP relaxation of unique games [KV05]. 
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Maximize E E 

c€D,h,h'€'H qeT 



^ '^(^c+h',e+h',bc+q+h,Uh) 



Subject to<&c,ft, bc,h') - 
{bc,h,bc',h'} > 
J]{bc,{,bc,f}^R 



(7.1) 

\fce!D,h*h' e'H (7.2) 
\/c,c' e!D,h,h' e9i. (7.3) 
Vce£) (7.4) 



For a vector c e F^, we will use (-l)'^ e R'" to denote the vector whose coordinates 
are given by (-1)? - (-1)"^' . For a pair of vectors c, c', we have 

<(-l)^(-l)<^') = l-2A(c,c'). 

For each vertex c € D associate vectors {6c,/, - (-1)'^"^'' ® (-lY'^'^lh e 9-1}. Notice 
that for a pair of vectors 6c,ft> bc',/i' we have, 

{brh, bc'h'} = = (1 - 2A(c + h,c'+ h')f . 



Since the distance of the code "?/ is ^, we have 



{bc,h,bc,h')^{'^-2m,h')f^ 



1 1 if /z = ^i' 
if/z^/z' 



(7.5) 



In other words, for every vertex c, the corresponding SDP vectors are orthonormal. The 
objective value of the SDP solution is given by. 



OBJ = E E 



E E 



p y^^{bc+h',{+h',bc+q+h,f+h} 

- - 2A(c + h' +€ + h',c + q + h + i + h)f 



E E f(l-2A(0,^))2l 

> 1 

where t is the number of queries made by the canonical tester T for C. 

Soundness. Let { : D ^ H he an arbitrary labelling of the Unique Games instance 
T(C,T). For each p e'H, define a function : £) ^ [0, 1] as follows, 

fp{c)= E [I[^(c + /i)=p + /i]] . 
The fraction of constraints satisfied by the labelUng ( is given by. 



OBJ = E E 

c€D,h,h'€'H q€T 



2^ + h')^p + h'] ■ I[e{c + q + h)^p + h] 
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= E E 



= E E 

ceO qeT 



y E Wic + h') = p + h'] - E I[{{c + q + h) = p + h] 



pe'H 



he'H 



fp(c)fp{c + q) 



pe'H 



pe'H 

where G - Cay(C"'", 7") is the graph associated with the code and tester T. 
The expectation of the function fp is given by, 

E fJc) = P U{c + h) = p + h] 

CSV ' ceDMe'H 

P \£ic) = p + h] because (c + h,h) ~ (c, h) 
1 _ 1 

W\~R' 

Since fp is bounded in the range [0, 1] we have, 

{fpjp) = E [fpicf] < E [fpic)] = i . 

ce2J ceD K 

Applying Corollary 4.10, we get that for each p, 

1 



(7.6) 
(7.7) 



(fp^GfpX- - min \ l-2sik) + 



Rl/2 



Substituting the previous equation in to (7.7), we get that the fraction of constraints 
satisfied by i is at most 



min ll-2s{k) + ^] 



8 Hierarchy integrality gaps for Unique Games and Related 
Problems 

This section is devoted to the construction of a integrality gap instance for a hierarchy 
of SDP relaxations to Unique Games. More specifically, we consider the LH^ and SA^ 
SDP hierarchies described in [RS09]. For these SDP hierarchies, we will demonstrate 
the following integrality gap constructions. 

Theorem 8.1. For every £,6 > 0, there exists an F2-Max-2Lin instance I for some 
positive integer t, such that no labelling satisfies more than 6 fraction of edges of Y 
while there exists an SDP solution such that, 

- the SDP solution is feasible for LHr with R = exp(exp(n(loglog^^^ A^))). 

- the SDP solution is feasible for SAr with R - exp(Q(log log'^^ A'^)). 
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- the SDP solution has value 1 - 0{e). 
where N is the number of vertices in the instance I. 

Remark 8.2. Composing the above SDP integrality gap with Unique games based 
hardness reductions yields corresponding gap instances for several classes of problems 
like constraint satisfaction problems (CSPs) and ordering CSPs like maximum acyclic 
subgraph. Specifically, up to exp(exp(Q(log log'^^ A/^))) rounds of LH hierarchy or the 
exp(n(log log'''^ N) rounds of the SA hierarchy can be shown to have te same SDP inte- 
grality gap as the simple SDP relaxation for every CSPFor the sake of brevity, we omit 
a formal statement of this result here. 

Towards showing Theorem 8.1, we follow the approach outlined in [RS09]. At a 
high-level, the idea is to start with an integrality gap instance T for a simple SDP relax- 
ation for unique games over a large alphabet. The instance F is reduced to an instance 
^£,Q,d^) of unique games over a smaller alphabet using a reduction similar to Khot et 
al. [KKMO07]. Moreover, the SDP solution to the simple SDP relaxation of F can be 
translated to a solution for several rounds of SDP hierarchy for ^e,q4(X)- 

Let F be an instance of F2-Max-2Lin over a set of vertices V(Y) and edges ^CF). 
On every edge {u, v) € ^(F), there is a constraint of the form u - v - for some 
a € F'^. We will reduce F to an instance of 2-Max-2Lin instance using the two query 
test described in Section 6. 

Translations. Notice that Reed-Muller codes are invariant under translation of its co- 
ordinates. Therefore, the code & and the test distributions 77_e are both invariant under 
translation. Formally, for an or e Fj, the translation operator : is defined 

by 

{Ta o c)p = cp^„ Vc € e^,yS € F« . 

Given a codeword c e 2)', we have o c e D' . 

We are now ready to describe the reduction from F to an instance of Fj-Max-ILin. 

The vertices of *£,Q,rf(F) are V(F) x D'. Let { : y(F) xD' ^ Qhea. labelling of the 
instance *Pe_Q rf(F). 

Folding. The labelling £ is assumed to satisfy {{v, c + r) = £{v, c) + r for every vertex 

V e V(T), c e ly and r e Q. This is enforced by "folding". 

The constraints of ^e,g,rf(F) are given by the queries of the following verifier. 

- Sample a vertex u € V(T) uniformly at random. Sample two neighbours 1^1,^2 £ 
N{u) of u uniformly at random. Let the constraint on the edge (m, u,) be Vi-u - a, 
for /€ {1,2). 

- Sample an element c\ € uniformly at random, and sample a neighbour C2 € 
& of ci in the graph Cay(2)', Tt,s)- 

- Sample an element r e Q uniformly at random. 

- Test if i{vi,{Tai o a) + r) - r = ({v2, Ta^ o C2). 
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Soundness. 



Lemma 8.3. For all sufficiently small constants e, (5 > and all choices of Q - 2', 

there exists y, d such that if no labelling ofY satisfies more than y fraction of edges, then 
every labelling of^s,Q4i^) satisfies at most QFp (Vg) + S fraction of constraints, where 
p - e~^. 

Proof. Let £ : V x D' ^ Q be a. labelling of the instance ^£,Q,rf(r). For each vertex 

def 

V € V{r), let F" : D' ^ Q denote the labelling £ restricted to the vertex v, i.e., F"{c) = 
{(v, c). For each vertex v e y(r) and ^ € Q define f^ : & [0, 1] as 

/;(c)1l'l[nc) = ?]. 

Due to folding we have f^{c) = Z^+^Cc + r) for all r e Q. Moreover, this implies that 
Eceor f^ - ^- Finally, for a vertex u e V(T) and r e Q define. 



veN(u) 



Clearly, for the functions h" also we have. 



E< = - \/ueV(r),reQ 
p Q 



(8.1) 



The probability of acceptance of the verifier can be arithmetized in terms of the 
functions h". 



P[verifier accepts] 



= E E E E 

Mey(r) vuV2eN(u) ci,C2&Cay{iy,T,,s) reQ 



YjfMTa,oc,+r)fl^{Ta,oc2) 



= E E E 



J]f^^(Ta,oc,)f^^{Ta,oC2) 



q^Q 



(folding) 



= E E 

Mey(r)ci,C2eCay(£)',7-,,^) 



= E E 

«eV(r)ci,C2eCay(£>',r,,^) 



E 

u&V(T) 



q^Q 



q^Q 



(where// = Cay(£)SnJ) 
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Suppose the probability of acceptance of the verifier is at least Q ■ Tp{^/Q) + 6. By simple 
averaging, for at least 6/2 fraction of the vertices u e V{T) we have, 



Let us refer to such a vertex u as being good. 

Fix the parameters t, L, d to those obtained by applying Theorem 6.4 with parame- 
ters s,6llQ. Recall that by (8.1), we have E2)t[/i^] = ^. Applying Theorem 6.4, if for 

each q ^ Q, maxaev" Inf^'(/io) < t then, 



Y^{h"^,Ghp<QY,iyQ) + Q.:^, 



This implies that for each good vertex u there exists q, a such that Inf ^^(/j^ > r. We 
will use these influential coordinates to decode a labelling for the F2-Max-2Lin instance 

r. 

For each vertex v e V(T) define the set of influential coordinates as, 

S, = {ae F'^l Inf^^(/jp > T/2for some q e Q]U{a e F'^| Inf^^(/^") > T/2for some q e Q} 

(8.2) 

Using Lemma A.l, for each of the functions /i^ or there are at most 2L/t coordinates 
with influence greater than t/2. Therefore, for each vertex v the set 5 „ is of size at most 
2 • e • 2L/T - 4QL/T. 

Define an assignment of labels A : V{T) as follows. For each vertex v, sample 

a random or e S „ and assign A{v) = a. 

Fix one good vertex u, and a corresponding q, a such that Inf ^^(/j^ > r. By defini- 
tion of h'^ this implies that 

\veN(u) ^ I 

which by convexity of influences yields, 

T< E [Inff(r„„„o/")]= E rinf^^^^J/;)]. 

Hence, for at least a t/2 fraction of the neighbours v e N(u), the coordinate a - auv has 
influence at least t/2 on Therefore, for every good vertex u, for at least t/2 fraction 
of its neighbours v e N{u), the edge (u, v) is satisfied by the labelling A with probability 
at least jj^i ^ t^/\6Q^L^. Since there are at least a (5/2-fraction of good vertices u, 

c 2 r 3 

the expected fraction of edges satisfied by the labelling A is at least | • ^ • j^^jj = 64^277- 

By choosing the soundness y of the outer unique game F to be lower than ^^^2^2 
yields a contradiction. This shows that the value of any labelling { to ^s,Q,d(^) 1^^^ 
thanerp(l/0 + <5. 

n 
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SDP Solution. We will construct feasible solutions to certain strong SDP relaxations 
of ^e,Q,di^) by appealing to the work of [RS09]. The SDP hierarchies that we consider 
are referred to as the LH and SA hierarchies. Informally, the r''' -level LH relaxation (LH^) 
consists of the simple SDP relaxation for unique games augmented by local distributions 
fis over integral assignments for every set S of at most r vertices. The local distribution 
fis is required to be consistent with the inner products of the SDP vectors. Alternately, 
this SDP hierarchy can be thought of as, the simple SDP relaxation augmented by every 
valid constraint on at most r vertices. 

The SA hierarchy is a somewhat stronger hierarchy that requires the local distribu- 
tions jUs to be consistent with each other, namely, jjs and juj agree on 5 n T. Alternately, 
the SA hierarchy corresponds to the simple SDP relaxation augmented with r-rounds of 
Sherali-Adams LP variables. We refer the reader to [RS09] for formal definitions of the 
SA and LH hierarchies. 

Lemma 8.4. Suppose F has an SDP solution that of value 1 - 77, then there exists an 
SDP solution to the instance ^E,Q,d(y) such that, 

- the SDP solution is feasible for LHr with R - 2"^'^''''"'*). 

- the SDP solution is feasible for SAr with R = Q.{e/?]^^^). 

- the SDP solution has value 1 - 0{e) — o,;(l) on ^e,g,rf(r). 

Proof. This lemma is a direct consequence of Theorem 9 from [RS09]. 

In [RS09], the authors start with an integrality gap instance F for the simple SDP 
for unique games, and then perform a traditional long code based reduction to obtain an 
instance q(F). 

The crucial observation is the following. 

Observation 8.5. The vertices of *I'e,g,rf(r) are a subset of vertices of q(F) - the 
instance obtained by the traditional Q-ary long code reduction on F. 

Proof. The vertices of ^s.Q.di^) are pairs of the form (v, c) where v e V{r) and c € D'. 
The codeword c e & can be thought of as a string of length N = 2" over the alphabet 
Q = Fj, namely, c € Q^" . The vertices of the instance S)^ q(F) obtained via a traditional 
Q-ary long code reduction is V{T) x . Hence the observation follows. □ 

In [RS09], the authors construct an SDP solution for the instance g(F) that is fea- 
sible for LHr relaxation with R = 2^'-'' and for SA^ relaxation with R = Q.(E/r]^''^). 
As noted in 8.5, the vertices of *Fe_g(F) are a subset of the vertices of 0£_g(F). Therefore, 
the same SDP solution constructed in [RS09] when restricted to the instance ^e,Q,d(F) 
yields a feasible solution for the corresponding LHr and SAr relaxations. 

To finish the proof, we need to show that the value of the SDP solution from [RS09] 
is 1 - 2e - Ofjil). 

The traditional long code based reduction to get <I>c,q(F) uses the noise stability test 
as the inner gadget. Namely, to test if a function / : Fg — > Fg is a dictator function, 
the verifier picks x € uniformly at random, and rerandomizes each coordinate of x 
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independently with probability s, and then tests if f{x) - f{y). Composing this noise 
stability test with the outer unique game T yields the instance Og Q(r). The value of the 
SDP solution constructed for <I>e,Q(r) in [RS09] depends only on the expected hamming 
distance between the queries x,y. More precisely, in Claim 2 of [RS09], the authors 
show that if the distribution on the queries (x, y) e Fg x Fg is chosen to be an arbitrary 
distribution NS over F^" X F^ , the SDP objective value of the solution is given by 

F [xe = ye]-e. 

{x,y]~NS,{e[2"] 

The instance *I'e,g,rf is obtained by using the following distribution of x, y over F^ x 
Fg - sample (ci , cj) an edge in Cay(2)', Tt^s)- 

By construction, for any coordinate £ € [2"], P[xc = yc = I - 0(e). Therefore,using 
Claim 2 of [RS09], the SDP objective value on the instance *Pe,g,rf(r) is at least 1 - 
0(2e)-o,(l). 

□ 

Proof of Theorem 8.1. Fix t = riO/elog(l/(5)l and Q = 2^. By our choice of Q, we have 
QTe-e{l/Q) < 6 (see Appendix B in [MOO05] for such asymptotic bounds on F). 

Fix 7, fif depending on e, 5 and Q as dictated by Lemma 8.3. Let F be the Unique 
games instance obtained by Corollary 7.2 with the optimal integral value set to y. In par- 
ticular, F is a F2-MAX-2L1N instance that has M = 2^'°*^ " vertices. Its SDP optimum for 
the simple Unique games SDP relaxation is at least 1 - 0{C{e, 5) /n) (77 = 0{C{e, 6) jn)) 
for some constant C(e, 6) depending on e, 6. 

Now we apply the reduction to F2-Max-2Lin outlined below to obtain an instance 
*£,e,d(r)- The number of vertices of the instance ^e,Q,d(F) is \V{Y)\ x Note that the 
choice of the degree <i is a constant (say d{E, 6)) depending on e, 6. Hence, the number of 
points in & is given by = 2'^^"'"''''"^\ Therefore, the number of vertices of ^e,g,£/(F) 

is = 22'"^'" • 20("*'') = 22""°^^'"'. Equivalently, we have n - 2"(i°gi''g"'^). 

- By Lemma 8.3, the optimal labelling to ^£,Q,d(F) satisfies at most QTe-s{l/Q) + 
5 - 0{S) fraction of constraints. 

- By Lemma 8.4, there exists an SDP solution to the instance ^e,Q,t;(F) with value 
1 - 0(e) - Ot^{\). Since 77 - 0{C{E,6)/n), for large enough choice of n, the SDP 
value is at least 1 - 0(e). 

- The SDP solution is feasible for LHr for R = 2"^'^'''?"") = 2^(^-''^""' = 
exp(exp(Q(loglog'^2^))) rounds, where c{£,6) is a constant depending on e and 
(5. Furthermore, the SDP solution is also feasible for SAs for 7? - D.{e/t]^^'^) = 
c{E,d)n^''^ = exp(a(loglog^/2^)). 

□ 
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A Missing Proofs 

Proof of Theorem 5.6. Suppose d > Clog(l/T) for C to be chosen later and fix y < 1/8 
for y to be chosen later. Let € = log(l/T)/4ci < t^2''+^, where ci is the constant from 
Theorem 5.7. For a eF^ /C, let Aa be the eigenvalues of G. Then, by Lemma 4. 13, 

\Aa - < T, for k<,l, \Xa\ < p'^'^, for k>£. (A.l) 

Let g - G^/ and G' - G^~^^. Then, the graph G' has the same eigenfunctions as G -Xa 
for a e /C with eigenvalues A'^ = a\, From the above equation, it is easy to check 
that, for p' = p^"^^, 

K - (P'/I < V^, for k<{, \A'J < (p'Y'^, for k>i. (A.2) 

Further, as the eigenvalues of G are each at most 1 , the coordinate influences of g are no 
larger than those of /. 

Now, decompose g - g^^ + g^'^ into a low-degree part g^^ - XaeF^, wtca^f gio')Xa 
and a high-degree part - I^cyevyc, Ma,C)>e g{a)Xa- Then, 

</, Gf) = (g, G'g) - (^^^ G' g""') + {g'^' , G' g^') < (^^^ G'^^') + • max 4 . 

Q'eF?'/C, A(a,C)>f 

Hence, using Equation (A.2) (and the crude bound < 1), 

</,G/>= 2 (pT'^'W +(p'/+V?. (A.3) 

QrEF^,wt(a-)<^' 

Observe that g^'^ is a multilinear polynomial of degree at most t and as the ^-degree 
influences of g are at most r, g'^'^ is r-regular. 

Let S c {±1}^ be the set of {+l}-vectors corresponding to the Reed-MuUer code 
C""" = RM(?i,<3f), that is, for every codeword c e C""", the set S contains the vec- 
tor ((-1)'^', . . . , (-1)'^'"). Then, as g is [0, l]-valued on and ^ measures distance to 
bounded random variables, by Equation (A.l), 

E[^o^<^(z)] < F\{g{z)-g''\z)f\ = E[(9>'(z))'] - B[{G^f^'{z)f] < 

max {Alf < p^^. 

Qr:|ff|>f 

Hence, by Theorem 5.7 (recall that £ = log(l/T)/4ci), 

E o g^'ix)] < E o g^\z)] + 2^(^) < p^^ + r'/^ . 
x~[±lf z~S ^ „ 

Now, as C""" is ^-wise independent {£ < 2'^''"'), 

E Ig^'ix)] = B [g^'{z)] = E [g{z)] ± E [{g>'{z)f]"^<fi+ ^. 

x~[±lf z~S z~S z~S 
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Therefore, by Corollary 5.3, 

(9(l0gl0g(l/77)) 

<r , Tp,g- )= 2^ ip) ' ^g{a) < Tp,(M + ^/^) + (A.4) 

Since rp>(p + ^JT}) < Tp>{ji) + 2^/ri and Tp(p) < rp>(p) + \p-p'\/{l - p) (cf. Lemma B.3, 
Corollary B.5 in [MOO05]), it follows from Equations (A.3) and (A.4) that 

(/, Gf) = (g, G'g) , r.^u) . O (^) . 0( V^) . ^ P^^^^^^ - V? 

\ 1 - P / (1 -p)log(l/?7) 

\ 1-P (1 -p)log(l/77)/ 

(Here we used the estimate \p - p'\ = |p - p^'^^l = 0{y\og{l/p)).) By choosing 
d > Clog(l/T) and y - C^loglog(l/T)/(log(l/T)log(l/p)) for an appropriately large 
constant C, the above expression simplifies to 

/frfs^rr , ^ OOoglogOAr)) 

</, Gf) < TpOu) + — — — . 

(1 -p)log(l/T) 

□ 

Proof of Lemma 5.15. Since X fools r-regular degree-f PTFS, we have for all u ^ 0, 

o Q{X) >u}-¥{^o Q{Y) >u} < 0{s) . 

By hypercontractivity and 20£-wise independence of X, 

P o Q{X) > mK P {12(^)1 > Vm) < E Q{Xf° < M-"'2^^''> . 
Since ^ o Q(X) is a non-negative random variable, 

Efoem = /PKoe,x)>„id„ 

Hence, we can bound its expectation 

E^o2(X)= I P o Q(X) > u} du 

Ju>0 



= f P o e(X) > m) dw ±2^^^^ [ M"'°dM 

= 1 P 1^ ° > m) dw ±0 (sM + 2^<^Vm'^) 
= E ^ o e( F) ±0(£M + e^^^''^/M'') . 



(In the last step, we used that P o Q{Y) > m} < m io20(0^ ^ consequence of hyper- 
contractivity.) Choosing M = I'^^^^s^ '^ (so that eM - I'^'-^^M'^), we conclude that 
E^o 2 = E^o e±e<'-''2'^(^). □ 
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A.l Proofs from Section 6.1 

The following lemma shows a bound on the sum of influences. 

Lemma A.l. For a function f:D' R and t < dist(C0/2, the sum of (-degree 
influences off is at most J^ic^iN] I^iff^C/) ^ ^ 

Proof. The usual identity for the total (low-degree) influence holds, 

2lnff(/)= 2 wtOS)/W<^V/. □ 

Analogous to Theorem 5.7, the following invariance principle can be shown for reg- 
ular multilinear polynomials. 

Theorem A.2. Let N = 2" and t be an integer. For every T,i > 0, there exists C such 
that for d > Clog(l/T) the following holds: if P : R^' Rbe a T-regular polynomial 
of degree at most t then, for x €„ {±1}^', z RM(?i, J)', 

|E[^ o P(x))] - o P{zm < 2^'' V^, 

for a universal constant c\ > 0. 

The proof follows easily from the proof of Theorems 5.9 and 5.7 and the fact that 
if RU{n,d) satisfies the properties of the PRG in [MZIO], then so does RM(?i,J)'. We 
omit the proof. 

The work of Mossel et al. [MOO05] also obtains bounds on noise stability of func- 
tions over product spaces of large alphabets namely Q'^ . The following corollary is a 
consequence of Theorem 4.4 in [MOO05]. The proof is analgous to that of CoroUai^y 5.3 
from Theorem 5.1. 

Corollary A.3. Let f : ^be a function with 'Ef - ji and E ^ o / < r. Suppose 

jjjf.y^30iog(i/r)/iogQ ^ j for all i € [A^]. Then, 

where Tp is the noise graph on with second largest eigenvalue p and Tp is the Gaus- 
sian noise stability curve. (Here, we assume that r is small enough.) 

Now we are ready to present the proof of the majority is stablest theorem over 
(Theorem 6.4) using Theorem A.2 and Corollary A.3. 

Proof of Theorem 6.4. Let Q -2' . Fix d > C log(l/T) for a sufficiently large constant C 
to be chosen later. Let y < 1/8 be a constant depending on e, 6 whose value will be cho- 
sen later. Let f - log(l/T)/4ci < t^2''^\ where ci is the constant from Theorem A.2. 
For a e /C, let Aa be the eigenvalues of G. Then, by Lemma 6.2, 

1^ - P™'^°'^l < T, for wt(Qr) < £, \Aa\ < p^^^"\ for wt(Qr) > £. (A.5) 
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Let g - C"/ and G' - Then, the graph G' has the same eigenfunctions sl?,G -Xa 

for a e Q'^ /C with eigenvalues A'^ - A^. From the above equation, it is easy to check 
that, for p' ^p^-^T, 

U'a - (fi'f^^"^\ < V^, for wt(a) < e, \A'J < (fi'f^^"\ for wt(a) > £. (A.6) 

Further, as the eigenvalues of G are each at most 1, the coordinate influences of g are 
no larger than those of /. Now, decompose g = g^^ + g^^ into a low-degree part g^^ = 
I^aeQ^. wt{aw gia)Xa and a high-degree part g"'^ ^ I^aeQ'^/c, m^X ^(«k£»- Then, 

</, Gf) - (g, G'g) = (g^', G'g^') + {g>', G'g^') < {g'^' , G'g^') + ■ max A',. 

aeQ'^/C, deg(a)>e 

Hence, using Equation (A.6) (and the crude bound //<!), 

{f,Gf}= (Py"'"'g{af +{p'fWU^. (A.7) 

ffee"/C,wt(ff)<f 

Observe that g'^'^ is a multilinear polynomial of degree at most ^-f. Since the /'-degree 
influences of g are at most t, it implies that the multilinear- polynomial g'^'^ is r-regular. 

Let S c {±1}'^' be the set of {±l)-vectors corresponding to the Reed-Muller code 
D' , that is, for every codeword c - {c^^\ c^'^\ . . . , c^*^) € the set S contains the vector 

((-l)'^i , . . . (-1) ' , (-l)''v ). Then, as g is [0, l]-valued on £) and ^ measures distance 
to bounded random variables, by Equation (A.5), 

E[^o^<^(z)] < ^ \{g{z)-g''\z)f^ = E [((?>^(z))2] = E [(G^/>^(z)f] < 



max {AlT < p' 

Q':wt(a)>^ 



Hence, by Theorem A.2 (recall that £ = log(l/r)/4ci). 



.v~{±l)" z~S ^ 

n-= 

Now, as ly is ^-wise independent {{ < l''^^), 

E [g^\x)] = BJg'^'iz)] = E [g(z)] ± E [ig''\z)f]''^<fi+ ^JT|. 

;c~{±l)" Z~S Z~S Z~S 

Therefore, by Corollary A.3, 

/ <f 7. <A V / 'Nwt(ff)^. ^2^^ ^ 0(floglog(l/T)) 

{g ,Tp>g >= 2, (P) V«) <^P'(M+ + n - o01og(l/T) - ^ ^ ^ 

Since Fp^// -1- s/rj) < Fp'(//) 4-21/^ and TpQu) < Fp'(/i) 4- |p -p'|/(l -p) (cf. Lemma B. 3, 
Corollary B.5 in [MOO05]), it follows from Equations (A.7) and (A.8) that 

</,G/> = (g,G'g) < Fp(,) + . 0( V^) . ^"g/^g^//;^^^ ,,n«i-..)./o , ^ 

\ 1 - P / (1 -p)log(l/T) 

By a sufficiently small choice of r, and fixing ^ = log(l/T)/4ci and y = 
100fciloglog(l/T)/(log(l/T)log(l/p)) (so thatp"(^^/'^ < l/log(l/T) and |p - p'| = 
0( log 1/p'iog i/rW' the error term in the above expression can be made smaller than 6. 

□ 
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